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ABSTRACT 


The design of a sixteen-bit pipelined adder CMOS inte- 
grated circuit 1S presented. The adder is designed to 
Maximize throughput and to provide for testability. 


Tutorial material on CMOS design is also presented. 
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I. INTRODUCTION 

For several years the ability of systems engineers to 
design custom digital integrated circuits has been growing. 
The Mead and Conway design methodology described in 
Introduction to VLSI Systems [keft. 1)])7  Ssernree the systems 
engineer to be his own logic circuit designer. A prolifera- 
tion of computer-aided design (CAD) systems such as the 
MacPitts silicon compiler [Ref. 2], the chip layout language 
(CLL) {Ref. 3], the graphics editor Caesar {Ref. 4], and the 
Burlap hierarchical layout language [{Ref. 5] make it 
possible for the engineer to rapidly carry the Mead and 
Conway design methodclogy through to a final design. This 
includes iterative simulation and redesign to provide justi- 
fiable confidence in the final design submitted for 
Labn cation. 

Many of the techniques utilized in the Head and Conway 
methodology and most of the CAD tools are based on having 
the final design implemented in a technology that uses only 
one type of doping for the semiconductor material in the 
active region of the transistors. Because of their higher 
Switching speed, negatively dofed metal oxide semiconductor 
(NMOS) transistor technologies are generally used. 

Selection of an NMOS implementation technology does 
provide the systems engineer with a complete and proven 
methodology for the design of a very large scale integrated 
(VLSI) circuit and allows the use of many extensively tested 
CAD tools. Like any other design decision, selection of 
NMOS implementation brings with it some limitations. There 
are two primary problems associated with NMOS digital 


CIiEeCilLeESs. 


The first is the ultimate switching speed limitation. 
Though many NMOS VLSI circuits operate at clock rates in the 
Omco, 10 MHZ range, there are many applications requiring 
higher clock rates. The second problem is the dissipation 
of the relatively large amount of power consumed by NMOS 
fmegaital circuits. State of the art, commercially available 
NMOS VLSI circuits commonly have power consumptions in the 
micinity Of 3 to 5 watts. Ccnsiderable design effort is 
required to insure that the dissipation of this much energy 
by a chip measuring approximately 5 millimeters on a side 
does not alter the performance of the micron sized features 
oa the chip. 

One group of technologies that offers both increased 
Switching speed and greatly reduced power consumption is 
complementary metal oxide semiconductors (CMOS). CMOS 
circuits also offer the benefits of greater radiation hard- 
ening and increased noise margin. In this thesis investiga- 
tion, much of the Mead and Conway methodology was utilized 
mamene design of a CMOS circuit. A general purpose color 
graphics CAD tool called Caesar that has been frequently 
used in the design of NMOS circuits was employed. en 
carrying out the design of the 16 bit pipelined high speed 
adder in CMOS two separate goals were pursued. The first, 
of course, is speed and the seccnd is verifiability. A high 
speed adder implies not only a high clock rate of operation 
but also a small latency between input of operands and 
output of the sun. 

A discussion of CMOS technologies and the implementation 
of logic circuits in those technologies follows in Chapter 
Ze Chapter 3 presents a description of the CAD tools used 
to construct and simulate the layout for the adder. The 
logic and layout design of the adder is covered in Chapter 4 
and is followed by a test plan for the fabricated chip in 
Chapter 5. 


II. CMOS CIRCUITS 


Before the design of CMOS digital cCirecnuiteeseanweee 
attempted, an understanding of how to best implement logic 
functions in CMOS is necessary. It is also important to be 
aware of the advantages and disadvantages of the different 
CMOS igplementation technologies. In tnis chapter the oper- 
ation of CMOS digital circuits is explained using Siniiae 
NMOS circuits as a benchmark for comparison. The different 
methodologies for assembling the CMOS pieces to produce the 
desired logical results are reviewed and the selection of 


the CMOS-Bulk p-well implementation technology is explained. 


A. COMPARISON WITH NMOS 


In NMOS digital circuits there is only one type of 
Switching device, namely the n-channel enhancement mode 
metal oxide semiconductor (MOS) transistor. The other prin- 
Cipal device utilized in NMOS circuits is the depletion node 
n-channel MOS device which acts as a load resistor. In CMOS 
there are both n-channel and p-channel enhancement mode 
transistors available. As in NMOS, the n-channel device can 
be considered on when Vdd (typically +5 Volts DC), a logical 
1, 1S present on its gate. The p-channel device can be 
considered on when grcund (GND), a logical 0, is present on 
its gate. In Figure 2.1 are the symbols that will be used 
for the n-channel and p-channel transSiStors in this encom. 

The basic differences between NMOS and CMOS technoiogies 
can be demonstrated by comparing their application to some 


Hasic digital Cirreyur.te 
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gate Saee 


p-channel 


= GND 


a | 


PlLguree. | CMOS Transistor Symbols. 


n-channel 


Figure 22 (a) shows ~an NUOS seam verter . Whenever 
there is a logical 1 on the ingut, the voltage drop across 
the lcad resistor is approximately Vdd and the output is a 
togaical 0. This results in steady state power consumption. 
When the input switches to a legical 0, before the output 
can assume a logical 1, the lcad capacitance (Cl) on the 
output must be charged to Vdd through the load resistor with 
a resistance of several kilohms. This results in a much 
donger transition frem 0 to 1, where the load capacitance is 
charged through the load resistor, than from 1 to 0 where 
the load capacitance is discharged through the switched on 
NMOS enhancement transistor. The reason for this asymmetry 
is that the pull-down transistcr's on resistance is typi- 
cally only one fourth or less that of the on resistance of 
the pull-up load depletion mode transistor. The technigue 
Geese precharging circuits, where all outputs are set to 
logical 1 during one clock cycie and then selectively forced 
to 0 on the opposite (evaluation) clock cycle has proven 
helpful in gaining control over the unsymmetric switching 
times. This longer switching time from 0 to 1 must stiil be 
accounted for, however, and represents the primary limita- 


tion to the speed of NMOS circuits. 
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Figure 2.2 (a) HMOS Inverter (b) CHOS Inverter. 


In the CMOS inverter of Figure 2.2 (b) the input is 
applied to the gates of both devices. An input of logical 1 
causes the n-channel device to switch on and the p-channel 
device to switch off, resulting in an Output of Logica 
Similarly, an input of 0 results in an output of 12) ino 
cases, one device is fully off, representing a resistance on 
the order of gigaohms. TAS, the steady state power 
consubftion is essentially zero. In operation the only 
power consumption of consequence occurs during the tran- 
Sition when neither transistor is fully on Or ODnE 
Additionally, Since the output load capacitance is both 
Charged and discharged through a turned on transistor, the 1 
to 0 and 0 to 1 switching delays are theoretically the same. 

Actually the switching delays depend on many parane- 
ters. The n-channel and p-channel device dimensions are 


frequently not the same, the robility of the electrons in 


12 


the n-channel is greater than the mobility of the holes in 
the fp-channel. Also, the capacitive load seen by the 
p-channel device in CMOS p-well (CMOS-pw) is greater than 
the load seen by the n-channel device because of the highly 
doped p-well. Typically, the result in CMOS-pw is a 
Siigmutily LONGeretbanSition timesot the 0 to 1 output tran- 
sition. some designers attempt to compensate for this by 
consistently making the p-channel transistors wider than the 
n-channel transistors. 

Un likeweneo, tne OUtputeor a CMOS digital circuit 
makes a full excursion between Vdd and GND. This makes CHOS 
circuits less sensitive to noise than NMOS circuits. CHGS 
should also benefit more from future reductions in feature 
size. NMOS is more restricted in ultimate feature size 
because the power disSipation requirements of the depletion 
mode devices will create more problems as feature sizes 
shrink. In Figure 2.3 the relative sizes of minimum dinen- 
Sion inverters implemented in currently available 3 micron 


feature size CMOS-PW and NMOS technologies are shown. 


2. The NOR Gate and Transmission Gate 


Figure 2.4 shows the circuit diagrams and layouts of 
a two-input NOR gate implemented in both CMOS-PW and NMOS. 
From Figures 2.3 and 2.4 it is evident that static! CMOS 
gates are more complex and aréa consuming than their Nios 
counterparts. In these fully complementary circuits a 
redundancy in the structures 1s evident. Thee PU 0p ominy, 
or pulil-dcewn only would be sufficient to implement the 
BOgic. ime tcnemenOSm CliCcults of Figures 2.3 and 2.4 the 
inputs must perform two tasks. A logical 71 on an input 


causes both a connection between the output and ground anda 





: Bota ttG  bOqic Gircuits .continuously evaluate their 
inputs and produce their specified logic output. Dynamic 
SeAECuits porte logical evaluation of the inputs only when 
directed to do so by control signals and/or clock signals. 
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Figure 2.3 Minimum Dimension Inverters. 


disconnection between the output and Vdd. Logically these 
two actions are eguivalent, therefore only one action should 
be necessary to implement the logic. Design nethodologies 
to accomplish this are described in section B of this 
Chapter: The parallelism of the CMOS transmission gate of 
Figure 2.5 and the NMOS pass transistor 1S evident. the 
major difference lies in the tilateral nature of the CHOS 
transmission gate. It 1s made up of both a-channel and 
p-channel devices and reguires both polarities of the 
control signal for operation. The reason for this bilateral 
reguirement is that the p-channel device does not transmit 


low voltages well and the n-channel device does not transmit 
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Figure 2.4 2-input Nor Gate. 


high voltages well. The resulting unpredictable voltage 
drops make it necessary to utilize both types of transis- 
tors. This increase in complexity over its NMOS counterpart 
is partially offset by the absence of the level restoring 


circuitry NMOS reguires following a pass transistor. #2 


>. ee a a: a ee ee 


peo NMOS digital circuits the length to width ratio of 
the pull down ransistor is usually four times that of the 
depletion mode transistor load. This ratio is reguired to 
Beoeure Sufficient excursion of the output voltage. However, 
after a pass transistor is used, a ratio of 8:1 rather than 
4:1 must be used to restore the VGS threshold voltage drop 
across the pass transistor. 
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in Out 


Figure 2.5 CMOS Transmission Gate. 


In general CMOS technolcgies are ratioless. The use 
of "improper" ratios will not affect the logical joperaames 
of most CMOS gatés, it will only affect the speed of ofera- 


tion or the gates. 


B. CMOS DESIGN METHOLOLOGIES 


Static gate CMOS circuits have three serious deficien- 
cies when compared to static NMOS gates. First, they ame 
more area consuming. Second, they can be slower. Though 
the individual gates can be faster in CMOS, the p-channel 
and n-channel gates are in parallel, thus, the fanout?@age 
the output load capacitance of each citeuit are ~doupdeg 
Third, a CMOS static gate sis redundant, dup licatinGpiaae 
functionality in both the pull-up and pull-down sccer ace 

One approach to remedy these deficiencies 1s to usea 
static NNOS-like style of design as in Figure 2.6 Here the 
p-channel device is always on and the pull-up to pull-down 
dimension ratio is relied upon to produce the proper output 
voltage. This introduces power conSumption problems’ and 


takes away the full excursion on the output. Another 


3Fanout represents the numker of transistors that the 
output of a logic gate must drive. 
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Figure 2.6 NMOS-Like CMOS Static Gate [Ref. 6}. 


approach is to make extensive use of transmission gates to 
mia up logic functions. Using transmission gates means 
both polarities of all control signals are required. The 
resulting large number of wires reguired to route these 
control signals can become very area consuming, especially 
1£ only one metal layer is available. 

A third and more effective solution is to use dynamic 
mogic. Figure 2.7 contains three different implementations 
of a dynamic three- input NAND gate. In each, the output is 
meaningful (i.e. represents the value of the boolean expres- 
Sion ini in2 in3) only when clk is high and clk is low. The 
Circuits of Figure 2.7 (a) and (b) depend on the pull-up to 
puli-down ratio to produce the proper output. As with the 
NMOS-like style of design, full excursion on the output is 
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lost and there is steady state power consumption during the 
evaluation cycle: The circuit in Figure 2.7 (0G evo oe ee 
harged when clk is lew and evaluation of the inputs takes 
place when clk is high. This configuration allows only one 
change of the output trom ~1eeoeer so the inputs must be 
stable at the time clk goes high. A change of one of the 
inputs from 1 to 0 after clk has gone high cannot cause the 
output to return to 1. 

In general dynamic CMOS eliminates the redundancy of 
static CMOS by applying all inputs to one type of device and 
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Figure 2.7 Dynamig NAND Gates [Ref. 6]. 
a control signal to the other type of device. The Osis 


Fopular dynamic CMOS logic design technique is domino CKOS 
[Ref. 7], illustrated in Figure 2.8 Here the output is the 
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moguGdl AND Ofwethe boolean function (ini in2+ in3) to be 
implemented and a control (clock) Signal. When the clock is 


low, the circuit is frecharged, and when the ciock is high 






oe en 2 a 3 


elock 


Figure 2.8 Domino CMOS Structure [Ref. 6]. 


evaluation occurs. Hith a common clock shared by ali the 
domino gates ona chip, during the evaluation cycle the 
Sighals ripple through the chip as though the logic were 
purely static. The follow on inverter insures that the 
output of each gate is low when evaluation begins. This 
prevents the outputs of all gates from changing unless 
driven lew by the inputs. Domino CMOS is not always the 
Saeower Choughas If the logic of Figure 2.9 were implemented 
in domino CMOS it would be more area consuming than the same 


circuit implemented in static CMOS. Dynamic CMOS is more 
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area consuming in this case because tnese are Simple gates 
with only a few inputs. Each NCR gate if implemented stati- 
cally would need two n-channel devices and two p-channel 
devices. If implemented dynamically, each NOR gate requires 
three transistors of one type (one for each input and one 
for the control signal) and one transistor of the other type 
(for the control signal again). The number of transistors 
needed remains the same but the dynamic logic requires the 
designer to keep three inputs electrically isolated instead 
of just two. And if the dynamic design technigue is domino, 
Six additional inverters will be needed. As can be seen in 
Figure 2.4, in CMOS a NOR gate can be constructed from just 
one stage. Adding the follow-on inverter of the domino 
design results in an OR gate. Thus a second inverter is 


required to return the logic to that of a NOR gate. 


pee 


a 


Figure 2.9 Circuit Difficult to Implement in Domino CHOae 


C. CHOS ITNPLEMENTATICN 2EGHNGL Ceres 


One of the principal issues in the design of a process 
to implement CMOS digital circuits in silicon is Howse 
isolate the two types of devices. This can be accomplished 
by using a completely insulating substrate or through a more 


complex fabrication frocess. 


ZO 


ie, 6 CMNOS=SCs 


The only process currently offered Ey Metal-Oxide 
Semiconductor Implementation Service (MOSIS) which uses an 
electrically insulating substrate is Silicon on Sapphire 
fos). in this technology the n-channel and p-channel tran- 
sistors are formed on silicon islands left after etching an 


epitaxial layer of silicon on a Sapphire (Al,03) substrate. 
ae MOS-Bulk 


The other CMHNCS processes offered by MOSIS all use 
CMOS-Bulk p-well technology. The p-well processes differ in 
the number of layers of metal interconnections (1 or 2) and 
the presence or absence of capacitors. in Cues-Buuk p-well 
(n-well) the substrate is n-doped (p-doped) and the 
p-channel (n-channel) devices are in this substrate. oO 
isolate the n-channel ({p-channel) devices from the substrate 
a heavily doped p-well (n-well) is first placed to act as 
the back gate. The heavy dofing of the p-well (n-well) 
degrades the performance of the n-channel (p-channel) device 
while the p-channel (n-channel) device is optimized. Tena 
p-well CHOS, though the mobility of electrons in the 
n-channel device still exceeds that of the holes in the 
p-channel device, the performance difference of the transis- 
tors is f@inimized. The more uniform performance of the two 
transistor types makes the p-well process appropriate for 
Seyos Candom Logic. 

Figures 2.10 and 2.11 represent the top and. side 
views of the steps of the CMOS-fw process for the production 
of an inverter. These steps are: (1) starting with an 
n-type substrate the p-well is patterned, (2) The active 
areaS in the p-well and on the substrate are established, 
(3) the polysilicon is patterned, (4) the two ion implant 


masks are placed (the Nt mask is simply the photographic 


ZN 


negative of the P+ mask) , (5) contact Cuts a@bemaade,, ane 
(6) the metal is placed. 


as Latenupein ClGoapw 


Onew or the main problems associated with 
CMOS-Bulk, both p-weil and n-well is Jlatchup. Basically 


latchup involves generation of a short circuit between Vdd 


and GND, and can result in the complete destruction of a 
chips Many researchers have tried to formally define the 
conditions [Ref. 8] that cause latchup to oceume This tas 


ls extremely complex because the phenomenon is so dependent 
on layout, which is unigue to each chip design. Though a 
fully quantitative analysis of latchup is still not avaue 
able, a qualitative analysis will show what happens on the 
chip when latchup occurs. 

Looking at the side view of an inverter in 
Figure 2.12, parasitic bipolar transistors can be seen. ia 
base of the npn transistor is the p-well and the base of the 
pnp transistor is the n-doped substrate. These parasitic 
transistors are connected as shcwn in Figures 2z-sou. If the 
output of the gates goes below GND by a value egqual to the 
threshold of the apn transistor, its emitter starts ee 
inject current (electrons) inte the base (p-well) and the 
resultant collector current flows to the Vdd node. If the 
resistance between the Vdd ncde and the source of the 
pull-up p-channel MOS transistor, R1, is large enough, the 
voltage drop across R1 will exceed the threshold of the Enp 
transistor. The collector current (holes) of the pnp device 
flows to the GND node. If the resistance between the GND 
node and the source of the pull-down n-channel MOS tran- 
Sistor, R2, is great enough, the resultant voltage drop 
across R2 will increase the base current in the apn tran- 


Sistor. As is evident, there is positive feedback. 
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Figure 2.10 
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Figure 2.11 P-Well Process, Side View [Ref. 9]. 


24 


The only way to stop this destructive process once it has 
started is to disconnect Vdd or GND. Prevention of latchup 


must Ee designed in. 
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Bigure 2.12 BLpoOld:simansistcrs in CMOS-Bulk” [| Ref. 6]. 
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Figure 2.13 The Latchup Circuit [Ref. 6]. 


The MOSIS CMOS-Bulk p-well design rules include 


features for the Specific purpose of reducing the 
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probabil lit yeor mela te nur The tininum separation Eules es 
p-wells and P+ doped active areas exist for this purpose. 
Their aim is to reduce the gain of the parasitic bipolar 
transistors, thus requiring a larger noise spike of longer 
duration to start the latchup sequence. A frequently used 
technique is the grounding of the p-well as illustrated in 
Figure 2.14 . Here the effect cf the P+ doped area covering 
half of the contact cut for the ground bus is to reduce the 
resistance R2 in Figure 2.13 . Another practice 1s to place 
a small capacitor across the Vdd and GND pins of CMOS-Bulk 
eni psi To provide capacitive filtering of noise spikes on 
the chip, Vdd and GND busses are frequently run close 
together. Also, Vdd input pads are designed to provide 


capacitance between Vdd and GND. 
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Figure 2. 14 Grounding of the P-Well. 


This process, also called twin-well, uses both 


m-wells and p-wells on a high resistivity N= (ora 
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substrate, or in an epitaxial layer of silicon on a Pt or Nt 
wafer. Since the well doping does not have to overcome the 
substrate doping, both the n-channel transistors in the 
p-well and the p-channel transistors in the n-well can be 
optimized. Domino CMOS is enhanced by the use of this 
process Since the optimized n-channel devices can speed up 
the complex boolean expresSion evaluation and the optimized 
p-channel devices can speed up the signal drive between 


stages (thereby reducing the effect of a given fanout). 


D. CMOS TECHNOLOGY SELECTION 


The CMOS implementation technologies available fron 
MOSIS are CMOS-Bulk p-well with one metal layer, CMOS-Bulk 
powell with two metal layers, CMOS-Bulk p-well with two 
metal layers and capacitors (for analog circuits) and 
CMOS-SOS. | 

The advantages of CMOS-Bulk are: (1) very good noise 
margin, (2) faster than NMOS, and (3) a proven reliable 
fabrication process. Its disadvantages are: (i) Latchug 
susceptibility, (2) use of p-well guard rings is needed if 
radiation hardening is desired, (3) lower circuit density 
than NMOS or CMOS-SOS, and (4) more complex design rules 
than either NMOS or CMOS-SOS. 

The advantages of CMOS-SOS are: (1) faster than NMOS or 
Sa0s—-Bulk, (2) very good noise margin, See nerensi ca lly 
radiation hardened, and (4) no latchup. Its disadvantages 
are: (1) expensive fabrication process due to the sapphire, 
(2) Sapphire variability reduces the reliability of the 
fabrication process, (3) thermal mismatch between the 
Sapphire and silicon limits the carrier mobility, and (4) it 
is not a viable technology for dynamic memory due to back 


Channel leakage. 


Zi, 


CMOS-Bulk p-well was selected as the implementation 
process for the adder for the following reasons. Filtstee 
technology files for this process were available at the 
Naval Postgraduate School (NPS) enabling the use of extant 
computer aided design (CAD) tools. Second, since this would 
be the first CMOS VLSI design at NPS, utilizing the moee 
reliable process is prudent to prevent design problems from 


bkeing clouded by implementation process problems. 
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III. DESIGN TOOLS 

To employ the Mead-Conway design methodology on a large 
scale design, three computer aided design (CAD) tools are 
needed. A layout design editor for viewing the circuits as 
they are created is the first tcol required. Next, a design 
rule checker is necessary to confirm that all the design 
rules for the specified technology have been adhered to. 
Though not a complex task, the large number of checks that 
must be made for even a modest design makes manual design 
rule checking highly error prone. Pindall yeaa Glreult sinu- 
lator is needed to verify that the circuit as designed 
provides the proper logical output. In the design of the 
Sixteen-rit pipelined adder, the Caesar layout editor 
[Ref. 4], the Lyra design rule checker [Ref. 10], adeGe 


Terman's RNL circuit simulator [Ref. 11] were employed. 


ae CAESAR 

Caesar is a generic layout editor. It is not designed 
for any particular VISI implementation technology. Teens 
not even limited to designing integrated circuits. Caesar 


1s a graphics layout editor for the creation and nanirula- 
tion of rectangles where the user specifies the color, size, 
and placement. It is through the user specified technology 
file that the rectangles of colcr take on meaning. At the 
Naval Postgraduate School (NPS) there are two technology 
files available for use with Caesar. One is for N-doped 
metal oxide semiconductors (NMOS) and the other is’ for 
complementary metal oxide semiccnductors utilizing a P-doped 
well (CMCS-pw). 


BS, 


Caesar works with files cf its own Special format. 
These file are indicated by an appended file type of ca(i.e. 
Soe Cay). On command Caesar will generate a Caitech 
Intermediate Format (CIF) file cf the same layout. Again it 
is the technology file which tells Caesar which CIF layer 
labels to attach to the colored rectangles. 

At NPS, Caesar is set up to take commands from any 
terminal where the execution of the Caesar program is initi- 
ated (usually the ADM-3a console adjacent to the color 
graphics display unit) and from a four-button puck yam 
graphics tablet attached to the color display device. 
Caesar displays its graphics results on an AED 767 color 
monitor and displays its menus, messages, and prompts on the 
command console. Detailed information on the installation 
and operation of Caesar at NPS can be found in Reference 4 
and Reference 2. 

Caesar 1S an interactive CAI tool. The results of any 
command are rapidly displayed on the AED 767. The resuits 
of a ccmmand may be undone (u) cr repeated (.) with a single 
stroke of the specified key on the command console. While 
running Caesar, a user may also call upon the design rule 
checker, Lyra, to check the area inside and within three 
Caesar units* of the current box for design rule violations. 
This interactive use of the layout graphics display and the 
design rule checker helps to insure that there will not be 
any design rule forced changes late in the design cycle whken 
changes are much more time consuming. With Caesar's level 
of interaction with the designer, the design loop consisting 
of (1) issue commands to perturb existing circuit, (2) 


visual inspection to verify command's generation of desired 





*A Caesar ooo ae is layed out on a grid of Caesar units. 
These units do no represent any speciitrze eigen a#hen 
creating a CIF file from a Caesar file the desired length of 
a Caesar unit 1s specified. 
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Beemwhes, ang {3) design tule Ghecking of new circuit, can be 
rapidly completed. 

Caesar is a hierarchical design tool. With Caesar, 
circuits can be created by piecing together cells (other 
files of type .ca) which in turn may be made up of other 
sub-cells. Theoretically, there is no limit to the number 
of levels in the hierarchy. Nct only can cells (sub-cells, 
etc.) be called upon to fill locations in a circuit, if they 
need to be modified to function properly, Caesar provides a 
subedit mode to facilitate editing of layouts one level 
below the current editing level. Care must be taken when 
this subedit feature is used since the changes made to the 
cell are giobal. Everywhere the given cell is used on the 


chip, the newly edited version will appear. 


B. LYRA 


Like Caesar, Lyra is a generic design rule checker. 
When Lyra is invoked from within Caesar, the actual prograz 
executed to check for design rule errors depends on the 
technology file indicated in the header of the Caesar file 
being edited. After running, Lyra sends a message to the 
command console indicating the number of errors found. On 
the graphics display Lyra paints the exact location of each 
error and labels each error with the design rule violated. 
The error label consists of akbreviations for the layers 
involved, followed by an underscore, followed by an abbrevi- 
ation for the type of violation detected. Table 1 lists the 
abbreviations used by Lyra for CMOS-pw. 

The winter 1983 distribution of the University of 
California at Berkeley (UCB) CAL tools included two versions 
Sree yra. One for the Mead-Conway NMOS design rules and the 
other for the Jet Propulsion Lakoratory's (JPL) five-micron 


feature size CMOS-pw design rudes. Since MOSIS no longer 


31 


TABLE 1 


Lyra Error Abtreviations 


Layer. Abbreviation ._ , SEEOL | 
poiysilicon Dp w minimum width 
neta 1 Ss minimum separation 
p-well W ie malformed transistor 
ne Gieerucaon d 
Cutts ; C 
E+ diziusion P 


Supports fabrication o£ the GEL CNS [aye eoc-ce design 
rules for the MOSIS supported three-micron CMOS-pw process 
were obtained. Professor Marco Annatarone at 
Carnegie-Meilon University (CNU)generated the listing of the 
three-micron CMOS-pw design rules compatible with Lyra and 
has provided NPS with a copy. To generate executakrle code 
from the prototype lyra program and imbed the specific 
process design rules, the program rulec (see Appendix B) is 
run with the design rule list file as its argument. 

Now, when Lyra is invoked from Caesar while editing a 
CMOS-pw technology circuit, the three-micron minimum feature 
Size CMOS-pw design rules are applied. This version of Lyra 
does not check for exceeding any maximum dimensions. The 
only maximum size design rule in this technology is for 
contact cuts, which may not exceed 3 microns by 3 micEoues 
Avoidance of improper contact cuts can be accomplished by 
utilizing Caesar's hierarchical nature. Contact cuts Ofmaum 
needed sizes and types are generated once and saved to be 


inserted as cells wherever needed. 


C. SIMULATION 


Once a circuit layout has completed this initial design 
loop, it matches the designer's conception of how it should 
appear and is free of design rule violations. The perforn- 
ance of the given circuit, though, remains uncertain. TO 
Simulate the performance of the design, programs such as 
SPICE [Ref. 11] and RNL [Ref. 11] are used. 
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1. SPICE 


SPICE is an important Simulation tool in the design 
S= hrgn Speed CMOS digital and analog circuits. Wo Chea tS 
detailed device modeling, SPICE can provide accurate 
predictions of performance once the device parameters of the 
implementation technclogy are known. SPICE provides’ the 
logical output of a circuit based upon the inputs and 
describes the transient behavior of the circuit as it 
changes to the new logical output. Thus SPICE enables a 
designer to optimize transistor dimensions for speed. 

Unfortunately, the version of SPICE currently avail- 
able on both the Vax 11-780 and the IBM 3033 at NPS (version 
2G6) fails when the parameters of the devices fabricated by 
the MCSIS three-micron CMOS-pw frocess are used. With these 
parameters the transient behavicr solutions do not converge. 

Engineers at CHU, UCB, and the University of 
Washingtcn (UW) are currently employing an experimental 
version of SPICE (version 2X.x developed at UCB) which is 
successful Simulating with the three-micron CMOS-pw device 
parameters. This version, however, has other bugs and is 
therefore not available for general distribution. The 
changes to SPICE 2G6 that enable SPICE 2X.x to simulate the 
three-micron CMOS-pw devices will be incorporated into the 
next distribution of SPICE {version 2G/7). The Naval 
Postgraduate School is in the queue of institutions to 
receive SPICE ZG7 once it is ready. 

Mmorder tO ELUM a SPICS Simulation of a CMOS circuit 
designed using Caesar, the following steps should be 
executed. First, the labeling feature of Caesar is used to 
place dlabels on the electrical nodes of interest in the 
Smecuit (Vdd, GND, input, OQUSPUTwectecs.).) Second, the Caesar 
command 

See dee 1 OO 21p 
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is issued to generate tne basename.cif file. The parameter 
100 indicates a scale of 100 céentimicrons per Caesar unit5 
and must be specified unless the default value of 200 
centimicrons per Caesar unit is desired. The -p optwom 
causes entries to be made in the basename.cif file for the 
labels assigned. Third, after exiting Caesar and returning 
to Unix, the circuit extractor Mextra [Ref. 10] is invoked 
using the command 


% mextra basenanme 


to create the file basename.sim. To modify the basename. sin 
file to a SPICE file (basename.spice), the program sim2spice 
{[Ref. 11) is used. The basenare.spice file contains a list 
of transistors and capacitors in the circuit in a  SPiee 
compatible format. 

The basename.sfice file must be edited to add the 
model parameters for the transistors, to specify the wave- 
forms of the input(s), to specify the type of analysis to be 
performed (usually transient analysis) andto specify the 
output to be produced (tables, graphs, etcoj@ The Spice 
User's Manual [Ref. 11] contains the formats of these addi- 
tions to basename.spice. Best case and worst case device 
model parameters for the MOSIS three-micron CHWOS-pw process 
as compiled by Dr. M Annaratone of CMU and Dr. tL. Glasser 


of MIT are round in Appendix A: 


Ze CRNE 


RNL 1s a timing and logic simulator for digital wigs 
Climo suse It is an event driven simulator which uses a 
resistance-capacitance model of a circuit to estimate node 


transition times and to estimate the effects of charge 


SSince the minimum dimensicns for the. 3-micron CMOS-pw 
process are specified in microns instead of lambda, CMOS-pw 
circuits are usually designed on Caesar uSing one micron per 
Caesar unit. 
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sharing.® After input values have been assigned by the user, 
RNL calculates the effects of those inputs Dy repeating the 
following operations until there are no further node value 
changes: (1) when anode is added to the network due toa 
transistor being turned on, the charge sharing implications 
of the new node's capacitance and logic state on each of its 
electrical neighbors is computed, (2) for each node that 
might be affected, Vthev and Ethev (the parameters of the 
Thevenin equivalent circuit) are calculated andthe new 
logic state is determined from Vthev (0.0Vdd to 0.3Vdd = 
Mogte) 0, 0.8Vdd to 1.0V¥Vdd = logic 1, logic X otherwise), (3) 
if the node has changed state, the transition time is calcu- 
lated using the node's capacitance, and (4) any changes are 
propagated to other nodes. Details of the computation 
methods used by RNL can be found in the RNL Version 4.2(U¥W) 
User's Guide [Ref. 11]. More important to the user is an 
understanding of what information RNL_ keeps, what it 
discards, and how it decides what to do next. 

Basic to the operation of RNL 1s the idea of an 
event. The three elements of an RNL event are; (71) a node 
in the network, (2) a new logic state for the node, and (3) 
the time when the node value changes to the new logic state. 
RNL maintains a list of events, sorted by time, that tells 
what processing remains to be done. When the user changes 
an input, an event is added to the list. RNL sequentially 
processes the next event on the list, stopping when (1) the 
list is empty, (2) a node the user is tracing changes value, 
or (3) when the specified simulation time interval has 
elapsed. To process an event, ENL removes it from the list, 


changes the node's state to reflect its new value, and then 


SS ae SS cee ae ee se ce ee oo 


6Charge sharing refers to the capacitive. effects that 
happen when two or more previously unconnected nodes each 
aoe 2 2g scme charge and capacitance, become connected by a 
MeaetlstOL {transistor turning on).. 
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calculates any new events resuiting from the node's trew 
value. 

In caiculating new events, first all nodes thaw 
Might be affected by the change are found and marxed. This 
includes the source and drain cr all transistors £oOr Wigs 
the current node is the gate and all nodes connected to 
these nodes through turned on transistors. The search 
through the network stops when a non-conducting transistor 
Or ah input is reached. For each marked node, two calcula- 
tions are made. First, a charge sharing calculation is 
performed to model changes of state due to the charging and 
discharging of node capacitances. Second, a £inal vai 
calculation is done to determine the node's ultimate logical 
state. 

A given node can have only two events pending: (1) a 
charge sharing event describing an immediate change in the 
node's state due to charge redistribution among the nodes on 
the connection list, and (2) a final value event describing 
tne. fina, driven state of the node. RNL observes the 
following rules for processing events: (1) when a new charge 
sharing event is scheduled, throw away all previously 
pending events for the node, and (2) when a new final value 
event is calculated, it will be ignored if (a) there is a 
pending final event for the same value which is scheduled to 
occur sooner, (b) there is a pending charge sharing event 
for the same value as the new final event, or (c) there is 
no charge sharing event and the new final value event is the 
Same as the node's current value. These rules are based on 
the assumption that the event that was last calculated 
reflects the latest configuraticn of the network and there- 
fore should override events calculated earlier. Charge 
Sharing events discard any ending final value events 
because any charge sharing calculation is immediately 


followed by a new final value calculation. 
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These event rules, however, sometines lead RNL to 
generate incorrect results. This is especially true of 
Signal driven circuits (circuits where inputs are applied to 
the source and drain of a transistor as well as its gate) 
and circuits that depend on the analog properties of the 
devices to predict the behavior of the circuit. BOE 


example, consider the first exclusive OR gate design for the 


ae 
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Pigure 3.1 CMOS Exclusive OR [Ref. 6]. 


pipelined adder in Figure 3.1 This design has proven to 
function correctly at CMU, however, the RNL simulation shows 
fees Cicrcuit failing. 

Starting in astate where A=0, B=1, and out=1, 
assume that the input A then transitions to 1. inet aa) y 
Sees, C4, and Q6 are on. When input A goes high, Q3 is 
turned off (no events generated) and Q2 is turned on, gener- 
ating a charge sharing event and a final value event for 


a 


Abar resulting In Abart gorude ve. When Abar goes low, the 
still turned on Q6 is now trying to drive the output node 
low and the still turned on Q4 (RNL recognizes that it takes 
a finite amount of time for Q4 to turn off Eut does not 


recognize that n-channel transistors do not conduct fiagm 


voltages well) is still trying to drive the output ede 
barges The result 1s an output of X, the undefined state. 
Next, Q4 is turned off. Since turning off Q4 adds no new 


nodes to the network, the event list 1s empty and the output 
remains at X. The primary difficulty RNL has with this 
circuit centers around the fact that the output node is 
controlled by two nodes that can change at different times. 
As a result, a charge sharing event due to one input can 
eliminate a final valve event of the other, with that final 
value event being the force which determines the circuit's 
actual behavior. 

The circuit cf Figure 3.2 1s a proven latch desiem 
which also fails in BNL simulation. In Figure 3.2 the {race 
tions next to the transistors represent the length to width 
ratios of the devices. This circuit is dependent on these 
ratios fcr proper operation. These ratios insure that the 
gain of the input signal on the gates of Q5 and Q6 is 
greater than the gain of the feedback signal to the same 
gates. RNL does not recognize the difference in these gains 
to be sufficient to cause the gates of Q5 and Q6 to be at 
elther logical 1 or 0 when the input signal is the opposite 
of the feedback signal. As arxresult, the circuit becomes 
locked uf at X. Because of RNI's difficulty with these two 
circuits, other designs were employed in the final adder 
(see chapter 5) to facilitate testing of the overall design. 

To use RNL as installed at NPS, the following steps 
should be followed. First latel the circuit and generate 
basename.cif as before. Again the program Mextra is used to 


extract the circuit, this time with the -o option (Mextra 


26 





Figure 3.2 CMOS Latch Design [Ref. 6}. 


basename -o). The -o option causes Mextra not to compute 
capacitances. A follow on program in this Sequence, Presing, 
performs this computation with greater accuracy. It should 


be noted that there are three different circuit extraction 


programs, each named Mextra. There is the MIT version, the 
UCB version and the UW modified UCB version. The next tool 
to be used in the seguence, Presim, can accept the output 


format of the MIT version and the UW modified UCB version. 
At NPS, the UCB version is installed and was used. The MIT 
and UW modified UCB versions differ in the order of the 
parameters in a transistor specification. Professor 
Annaratone at CMU developed a program, cformat, to change a 
°-Sim file generated ry the UCB version to the MIT format. 
However, cformat does not work if the -o option is used with 
Mextra. To avord a loss of accuracy, the .Ssim file can 
manually be changed to the UW modified UCB format. The 
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first step in this format cnange 4S tO USC thew t ee 
editor £0) addy ronan ac. UCB" to the header line of Lase= 
hame.sin. The other change that needs to be made is to 


Change the labels for the n-channel transistors from "na" to 


es Using the EX editor, the Following steps accomplish 
this: 

% e basenane.sin - invokes the editor 

; g/ D/S//e/g - make global change 


for all n aS firsStweena 
in a line, change toe 
: WwW - write back edited file 


og - €Xit sediftor 


The next step 1s to create a binary file for RNL 
from basename.sim using Presin. This is done by issuing the 
command: 


& presim basename.sim basename config 


Basename.sim is the edited .sim file and basename is the 
file into which presim writes its binary output. Config is 
the calibration file used to select other than default 
values for the circuit element capacitance and resistance. 
A copy of the presim user's guide from the UW/NWC VLSI 
Consortium release 2.0 and the calibration file used in 
Simulating the adder are contained in Appendix C. The 
values used in the calibration file are taken from the MOSIS 
supplied electrical parameters. 

The final step 1s to run RNL itself. This is done 
by entering one of the following two Unix commands: 

% rani Or 

%® cnl cmdfile 


where cndfile is the name of a file containing a sequence of 
RNL commands. Entering the first Unix command will cause 


RNL to take its commands directly from the console 
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interactively. If the second Unix command is used, speci- 
memgea commana fille, ANL first executes ail the Conmands in 
eiarwle and uponecomeletion, starts taking coumands fron tne 
console. In either case, RNL should be given the following 
commands: 

(ioad "uwstd.1") 

(load "uwsin. 1") 


(read-network "pas ename'") 


where basename is the file generated by presin. The first 
two commands load RNL with severai macros which simplify 
user interfacing with RNL. 

The user interface with RBNL is a LISP interpreter. 
The interpreter continuously executes the loop: (1) rceada 
command, (2) evaluate the command and perform the specified 
actions, and {3) print the result. There are two formats 
for specifying commands to this loop. fhe first is: 


(function argument argument ... argument) 


Here the parentheses delimit the command and spaces separate 

the elements. The interpreter reads the entire command, up 

to the closing parenthesis, then the first element is lLater- 

preted as a function and all the others as arguments. The 

arguments may be of the same command form, (function arg arg 

e-- arg). If the following command were issued to RNL, 
(cemec (te | (7 14 7 )) 


RNL would respond by typing 96 (12*4*2). The other format 
Zor commands to RNL is 


(function ‘(argument argument ... argument)) 


where the " ' " indicates the quote special form which keeps 
its argument from being evaluated. For example, (tS) 
evaluates to 5, but '(+ 2 3) is a string of three elements. 
When this second RNL command format is not used to represent 


an argument of another command (i.e. is not contained within 
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the parentheses of another command), 1t may De Welt cemeee 
the more natural form: 


Function argument argument 2.25 <newli ae 


Tutorials on RNL are contained in the University of 
Washington/Northwest ViSI Consortium's VLSI Design Tools 


Reference Manual [Ref. 11]. There are two points concerning 
the Mextra, Presim, RNL simulation cycle a user should be 
aware of that are not brought out in the documentation. fhe 
first concerns tke use of vectors in RNL connands. As 
evidenced in the tutorials of Reference 11and the adder 
Simulation results in Appendix D, vectors can be used to 
make the input and output of RNL less cumbersome and 
verbose. After the vector has been defined, a user will 
then want to assign values to it. The documentation shows 
the format of the vector vaiue assignwent command to be: 


(invec '(vecname valués)) 


However, tne "values" fieid has its own specific format. 
The first character should be a 0 or a 1 indicating posi tae 
and negative numbers, respectiveiy. The LISP interpreter 
will work with negative numbers but RNL wiil not accept 
negative numbers as logical inputs. The second character is 
a letter specifying the number base of the input vector (b 
for binary, h for hexadecimal). For example, to assign the 
binary value +101010 to the vector vectone, the RNL ccamand 
would be; 


(invec ' (vectone 05101010) } 


The other point concerns the location of imu 
labels on the input fads. When the entire chip is being 
Simulated, the input labels are normally placed on the metal 
pads where the off chip leads are attached. Sefore an input 
Signal from a bonding pad reaches the interior circuits of a 


chip it must pass through a resistor in an OVvVeEevVeieage 
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PEGECCTION GCleCult. fitness extraction and simulation 
process this resistor is viewed dsmean “Oppo <Ci.; cuit. 
Therefore, on input pads, the input label must be flaced 
after the resistor in the signal path. 

With Caesar, Lyra, and kENL, a designer at NPS has 
the requisite CAD tools for the complete logical circuit 
design loop. With these tools circuits that are free of 
design rule errors and produce the desired logical results 
can be designed. The lack of SPICE somewhat restricts the 
designer's ability to optimize speed, but there are several 
design technigues that can be employed to design chips that 


run fast. These will be covered in the next Chapter. 
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IV. DESIGN OF THE ADDER 

As stated in the introducticn, the primary yoals of the 
adder design are to maxinize throughput and to provide for 
testability. The adder is to te a pipelined adder. Every 
clock cycie it should accept as inputs two 16-bit addends 
(Al, the least Significant bit, through A16 and Bl, the 
least significant bit, through B16) and one Cazry-i aioe 
bit. It is desired to produce the 16-bit sum (S1 ,the least 
Significant pit, through $16) and the carry-out (Cout) gia 
as quickly as possible. Both the number of clock cyewes 
from input of the addends to the output of the sum and the 
duration of each clock cycle are to be minimized. A secon- 
dary consideration in the design is expandability. An 
expandable design is one that can easily be extended to 
produce a 32-bit or 64-bit sum utilizing the same circuit 
structures. In this chapter the logical design and layout 
design of the 16-bit adder will be presented. The equations 
presented in this chapter are taken or derived from equa- 
tions found in chapters three through six of The Logic of 
Computer Arithmetic by Flores [fef. 12]. In these eguations 
concatenation implies the logical AND, the symbol + implies 


the logical OR, and the symbol + implies the logical XOR. 


Ae. LCGICAL DESIGN 


In considering the speed spectrum of adders from a 
logical standpoint, at the fast end there is the table 
LOCK Up. Witn 33 binary inputs and 17 outputs, this woul 
require an address space of 233 17-bit words. With current 
technology this is not feasible. At the other end of the 


spectrum is the serial adder. On clock cycle 1 1t usesvaa® 
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Pipomdecln comme roduce SiwandeGiout (carry out of bit one 
pee Om tat Z ).. Gimelockeaycle 2 1t uses a2, E2,, anc Ciout to 
generate S2 and CZout. Here 16 clock cycles elapse before 
the sum is available. An adder can also be inplemented as a 
ripple carry aider where the duration of each clock fuise is 
Prt crene tO allow a Carry into the sum to propagate all 
the way through to a carry out. In the case of the 16-bit 
adder, this would reguire a cleck duration at least sixteen 
times the length of the gate delay of the one bit adder. 
The middle ground belongs to the carry look-ahead adder 
[Meets 3]. in carry look-ahead (CLA) addition the carry into 


each bit position, C(i), is generated from the proragate, 


Puy = AW O Bi) | foe ea) 
Giiy = Avy Buy (ecn 4.2) 
meme and generate, G(i), primitives. P{i) = 1 implies that 


Mearry into bit{i) wall. be propagated through to bit (i+1). 
Sey = «| implies that A (2) and B(1) will provide a carry 


into bit({i+1) of the sum, regardiess of the contents of the 
ee ee ee Om (ay) ray © (eGn 4.3) 
Si = COP) peas ed 


MmeeoeoigGhiticant bits of A and —£. The algorithm for the CLA 
Sum generation is as follows. The first event is the evalu- 
ation of eguations 4.1 and 4.2 to generate the P(i) and G(i) 
primitives. The second event uses the P(i) and G(i) prini- 
tives as inputs to equation 4.3 to generate the C{i)'s. The 
pada event is the computation of the S$(i)'s from ecuation 
a 
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As pointed ont by Flores [Ref. 12] and by Contagi yams 
Hauenstein [Ref. 3], there are several logical inplementa- 
tions of carry look ahead addition. A principal tasKiige 
this thesis investigation was to select a fast 1ogmeam 
design. Without the circuit simulator Spice, the analysis 
of each design considered was more gualitative than gquanti- 
tative. In this qualitative analysis, a turned on tran- 
Sistor is considered as a resistor with its resistance 
proportional to its length and inversely proportional to its 
width. All gates driven by such a turned on transistor are 
considered to be capacitive loads with Capacitance prope. 
tional to the area of the gate. The interconnect wiring is 
considered to add both parailel capacitive loading and 


series resistance as shown in Figure 4.1 







Rwire 


Cgatel es wall 


gate n 






Figure 4.1 CMOS Output Loading Model. 


From this model it is obvious that the amount of inter- 
connect wiring and the number of gates driven (fanout) 
should be minimized to minimize the output transition time 


when the fositions of switches 51 and S2 of Figure Go iam 
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reversed. This led to the following guideiines in the 
design of the adder: 
ieerne internal, logie of each stage should be acccn- 
plished with minimum dimension transistors , 3 microns 
Ce merous lengthh |Xewidth) . This leads to nore 
compact circuits with shorter interconnections and 
reduces the capacitive load on the preceding stage. 
meposobicicane.y wider transistors (3-micron x 9-micron) 
should be used at the output of each stage where the 
fanout and interconnect lcading is greater. 
3) The fanout of any transistor should be kept to less 
than five. 

This requires a more complete definition of fanout 
because the capacitive loading of a gate depends on its 
area. A Sol Menon xe —-mieron tClanSGistor driving Six other 
S-micron x 4-micron transistors nas a fanout of six. A 
3-micron x 8-micron transistor driving the same load is 
considered to have a fanout of three. Though this imrplies 
mete a high fanout problen can be solved bv _ wnerely 
increasing the width of the driving transistor, it negiects 
the efrects of the interconnect wiring. As gates are added 
to the load of a transistor, ¢acn subsequent addition must 
be more remote from the driving transistor. Since the 
resistance of the wiring is prceportional to its length and 
mavyersely proportional to its width, the resistance of the 
wiring will increase unless the width is also increased. 
However, Since the capacitance cf the wiring is proportional 
to its area, most of the gain achieved by widening the wire 
to reduce resistance is offset by the increase in capaci- 
tance. As a result, in the design of the adder, increasing 
the width of the driving transistor was not viewed as a 
complete f£1x for a fanout problen. 

For the comparison of the different approaches to CLA 


addition, the term logical event needs to be defined. The 
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most basic definition 1S a combinational Logic user re aes 
accepting a Ser Of “Tnpies, performing its specified opera- 
tions on those inputs and generating a set of outputs. 
Therefore, the input of the addends, followed by the compu- 
tation and output of the sum can be considered as a legical 
event. HoweveL, a primary design consideration for the 
adder is to provide for testarility and a key element of 
this provision is the availability of intermediate results 
(see section B of this chapter). This implies breaking up 
the sum generation into several separate events. The flpSe 
event taxes the addends as inputs, performs some logic oper- 
ation(s) on them and stores the results in a register. The 
next event takes its inputs from that register and stores 
its results in another register. This chain continues until 
the last event deposits the sum on the output pads of the 
chap. To provide the tester with easily interpreted inter- 
mediate results, the equations presented in this chapter 
were taken as boundaries for each logical event. The terms 
on the right side of the equation determine the inputs and 
the lert side terms determine the output of a logical event. 
Once all the inputs for an equation are generated by 
previous events, the logic of the eguation becomes part of 


the current event. 


This logic requires three events to generate the 
su. First, equations 4.1 and 4.2 are used to generate the 
P(i)'s and G{i)'s. Second, from equation 4.3 the C(i)'s are 
generated. Finally, the sum is derived from equation 4.4 
The principal problem with this approach for a sixteen-bit 
adder lies in the application of equation 4.3 Here, the 
input P(1i) has a fanout of 15, which makeS €his approgem 


unsatisfactory. 
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Noting that a four-bit sum generated using zero 
level CLA logic is within the design guidelines suggests 
cascading 4-bit slices of the same logic as indicated in 


Table 2 Here the sum iS available after six events and the 


TABLE 2 
First Level CLA Logic for a 16-bit Sun 


Event Bits Bits Bits Bits 
No. 1-4 2-6 i US 218s 
1 Compute Compute Compute aS mpute 
Pci) P(tj7G (2) P(1) 76 (1) Eom (2) 
zZ Compute TE Delay. Del ay 
C (1) P (1) .G (1) P (1) ,G (2) P (1) ,6 (2) 
2 Compute Compute Delay. a 
5 (4) C (i) p(dyretiy = P(A) OTH) 
4 Delay Comput Compute DeLay. 
S (1) S (1) C (1) P'(1) ,G (1) 
5 Delay Delay Compute Compute 
S (1) S (i) S (1) C (1) 
6 Dela; Dela Dela; Compute 
5 (i), S(i)- S(i)- $ (i) 


Tanout is reduced by a factor of four. 
would 


increase since 


The event cycle time 


reduction more than make up for the event count 


cycle time grows faster than linearly with 


fanout. The only drawback with this design lies in the cost 
Lt LO or 64-bit 


every 4-bit slice added, another event is reguired. 


of extending generate j3z-bit sums. POE 
Nous, ca 


64-bit add would reguire 12 events. 
3. Second Level CLA Logic 


Again the data is divided into 4-bit slices called 
blocks. 


BLOcKs, 


But rather than let the carries ripple through the 


two new primitive functions are introduced. They 
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are the block propagate, BP(1i), and block generate, 3BG(i), 
eM ete TOR. BP(i)=1 implies that a carry into DlockK (1) Suva 
be propagated through to (plock iia) BG (i)=1 implies that 
block (i) will generate a carry into block (it+1). For aaa 
block wkere bit(1) is the least significant bit, The BP and 

BG primitives are generated by equations 4.5 and 4.6 respec- 


tively, with the P(i)'s and G(i)'s computed as before. 


- BP iy = Poa P(syP (ay P (a) (eqn 4.5) 

BG (iy = G (4+ G (3) P (4) G (ay P (ay P (3) + © (1) P (4) P (3)? (2) (eqn 4.6) 
Next, / the block camry. sea, which represents the carry 
Ero bicet41) ante block (i+1), is computed using equation 


4.7 which represents the same lcgic as eguation 4.3 





i 1-] 
=e Secu eal (eqn ao 
k=0 pak +t 
Somtatr, after three events, the P(i)'S, G(i)'s, 


BP(i)'s, BG(i)'s, and B8C(i)'s have been generated. If the 
Same method of generating the final sum as used in zero 
level CLA were to be used, two additional events would be 
reguired. The first again applies the logic of equation 4.3 
to each 4-bit block to generate the carry into each bit. 
Here the Cin for block{i) is given by eG. ie The second 
cycle is used to generate the sum from the C(i)'s and 
F(i)'s. One of these events can be eliminated if, while the 
BC(i)'s and their predecessors are being computed, an esti- 
mated sum of the 4-bit block is also computed. One method 
is to compute two estimated sums for each block, one 
assuming an carry into the block of 0 and the other assuming 
anGdiinny manecor © |. When the correct carry in for block (ijime 
generated, it is used to multiplex the correct sum for the 


block to the output. This assumed carry method was rejected 
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because of the large amount of area consumed by the regis- 
ters needed to hold two possible answers. The second nethod 
is to compute the estimated sum of the block assuming a 
carcry-in of 0 and then correcting the estimated sum once the 
actual carry-in to each biock is known. 

Since the estimated sum, ES({i), is not needed until 
after the third event and computing it as one event again 
leads to fanout probiems, the ccmputation of ES(4), the most. 
Sagniticant bit, through ES{1) is computed in two events as 
follows. First, an intermediate estimated sun, IES({i), is 
computed using two-bit slices, each assuming a 0 carry in 
(see equations 4.8 through 4.11). At the same time, a carry 
Mmeom bait(Z2) into bit (3) (IC23) is computed using equation 
4.12 On the next event, ES (1) is computed from the IES (i)'s 


and IC23 using equaticns 4.13 through 4.16. 


IES) = Py (eqn 4.8) 

IES (2) = Pix) G aa (eqn 4.9) 

IES (s) = Pis) (eqn 4.10) 

JES ay = Puy Gs) (eqn 4. 11) 

1023 = Gi+ Gy) P (2) (eqn 4.12) 

ES yy = IES) (egn 4. 13) 

| ES (2) = IES (2) (eqn 4, 14) 

| ES (3) = IC 23G@)IES 5 (eqn 4.15) 
| 

ES = [IES (x IC 23]@ IES (4 (eqn 4. 16) 
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Now, after three events, eStimated sums for each 
YW-bit block and the actual carry into each block (Cinb) are 
available. From these the sum can be computed using equa- 
tions 4.17 thromgh 4.20 s- 


S (1) = Cina GES 1) (eqn Yu. 125 
Sin= [Cin ES |@ES ey ‘eqn Gogee 
Ss) = [Ci ES ES 2] OES (egn 4. 19) 


54) = [Cun ES ES (ES s)|© ES co lace 
Using second level Gin logic, the 16-bit sum is 
generated in only four events. Additionally, this design 
can easily be extended to the generation of 64-bit sums. 
The logic of equations 4.5 and 4.6 which produced the second 
level primitives BP and BG can be used again to generate 
third level primitives, B3P ard 33G. These third level 
primitives represent the carry propagate and carry generate 
properties of 16-bit slices. The carry into each 16—iam 
block 1S provided by implementing equation 4.7. Thus 
adding one event will provide the carry into each of four 
16-bit blocks of a 64-bit sum. The logic of equation 4.3 is 
then used to generate the carry into each 4-bit block of the 
Sum and the final sum is computed as before. The rinal 
result is that by adding two events, for a total of six, and 
using the same logic as before (i.e. no new circuits need to 
be designed), the 10-bit adder can be extended to a 64-bit 


adder. 
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Bem OESIGN FOR TESTA BILITY 


Another primary objective of the adder design was to 
paoyuae LOr testability, that is, the ability to logically 
detect fabrication errors or circuit malfunctions rather 
ian visually searching for faults with a microscope. 

As the complexity of integrated circuits has grown, the 
Beelaity to logically detect fauits using only the normally 
available inputs and outputs has decreased markedly. As 
complexity increases, the numkter of likely faults to be 
tested for and the number of input vectors reguired to 
isolate a specific fault grow rapidly. Unless a design 
technigue is used which allows the tester to examine the 
Mmitericr logic of a chip , the order of magnitude of the 
number of input vectcrs required to perform useful logical 
testing is prohibitive. Thus 1f Logicale testability is 
desired, a design technigue that provides for it must be 
used. 

One such design technique is level sensitive scan design 
miosD) [Ref. 13]. Level sensitive implies that the output 
of any logic element is dependent only on the levels of its 
mputs. No logic elements are allowed to depend on a tran- 
Sition such as in an edge triggered flip flop. Scan design 
implies that all memory elements in the design are to have 
an auxiliary function where their contents are serially fed 
to an output pad for examination. This gives a tester the 
ability to examine intermediate results. In applying the 
LSSD technique to the adder design, the following steps were 
taken. 

First, all circuits were designed to respond to the 
level of their inputs and not to require a transition to 
trigger their operation. Second, to insure that each logic 
event worked only with stable, non-fluctuating input levels, 


the inputs to each event were gated. The input gates were 


re 


opened only after tne inputs were stable (i.e. the outputs 
of the previous event were stable) and closed before the 
input gates of the previous event were opened. Thi ode 
dual mode latch was used to stecre the output of each logic 
event. In toe normal node ci operation, the register 
latches the outputs of one legic event in parallel and 
stores them to be used as inputs for the next logic event. 
In its secondary mode of operatioa, the register stops 
taking its parallel inputs and starts to run asa shift 
register, shifting its contents onto an output pad. 

One of the conseguences of using the LSSD technique is 
the large amount of area consumed by the dual mode regis- 
ters. In high speed operation, an inverter pair would be 
sufficient to store inter-event results. But to permit low 
speed testing where the capacitance of a gate may discharge 
during one clock phase, and provide the dual mode feature, a 


pair of clocked latches with control circuits is required. 


C. LAYOUT DESIGN 


With the logic decided upon, the next step was to create 
the layout of the adder. The lcgic consisted of four events 
to produce the sun. Another event was needed to latch the 
input data onto the chip. A two-phase clock was needed to 
losure that two adjacent events did not run simultaneously 
(insuring stable inputs to each event). To make the output 
of the adder compatible with the input to another adder, a 
one event delay was added. This insures that the output of 
one adder does not change while a second adder is using the 
sum from tne first as an input. With two 16-bit addend 
inputs, one carry-in input, one power supply (Vdd) ~a2npuw 
one reference (GND) input, a 16-bit sum output, one carry- 
out output, and two clock inputs, ten pads were left froma 
standard 64-pin chip for register mode control input and 


register (shift mode) output. Since the design called for 
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five registers, one for each logic event and one (for 
meche@ng the 1Nput data, five pads were used for input of 
the register mode control signals and five were used for the 
registers to serially output their contents. With the 
required inputs and output identified, the preliminary floor 


plan shown in Figure 4.2 was created. 
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Pigure 4.2 Preliminary Chip Floorplan. 
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The first circuit designed was the dual mode latcima 
Figure 4.3 Here the circuit is designed to latch the IN 
level when Control is low (Control is high) and phil Ys)hage@ 





Pe 


. “i 





Figure 4.3 Dual Mode Latch. 


(phil is low). When phil goes low, a copy of the input is 
also stored in the second latch and becomes available at 
shift-out which is ccnnected to shirtt-in of the next Dave 
When control goes high, the IN signal is blocxed and the 
latch takes its input from the register to  thesteum The 
shift-in of the leftmost latch in a register is tied to 
Ging tnd Versatec plots of the actual layouts of this dual 
mode latch and the other circuits described in this section 
are given in Appendix E. 

The AND gate used was corstructed from a NAND gate 
followed by an inverter as shown in Figure 4.4 Similarly, 


the OR gate was constructed frem a NOR gate followed by an 
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inverter (see Figure 4.5). Although logic implemented using 
these AND and OR gates iS more area consuming than the sane 
logic inplemented in NAND and NCR gates only, the penalty is 


not severe because they were used infrecuently in the final 


design. 
: | 
A 
? 
| oF 
| 
Figure 4.4 AND Gate. 
<< nga PE eae 
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Figure 4.5 OR Gate. 





The exclusive OR gate (XOR) Was GCONStructed from two 


inverters and three NAND gates as shown in Figure 4.6. 


Sed 


Though this design 1s consideratly more area consuming than 
the XGR gate of Figure 3.1, it was selected because the RNL 


circuit simulator could correctly model its operation. 





Figure 4.6 Exclusive OR Gate. 


More complex logic functicns were implemented using 
programmed logic arrays (PLA) where the outputs are the 
jogical sum (OR) of the products (AND) or aqupuece A single 
phase design was needed. A FLA designed to compute when 
phil is high, between the time the preceding event kad 
produced stable outputs (phi2 gcing low) and the time phil 
goes low, had to produce tne precper sum-of-products resuiae 
To hold down fanout, a dynamic structure was needed so that 
inputs could be applied to a sirgle type of transistor. Te 
prevent steady state power consumption a precharged dynamic 
Structure was needed. Because of charge sharing, the prec- 
harging must take place while the inputs are present on the 
transistor gates of the PLA (see chapter 5, section C, for a 
complete explanation of the charge sharing problem in this 
PLA structure). Thus, two distinct events must occur during 
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this time period. Pest cena Outs NMUSt be applied and 
precharging must take place. Then evaluation must occur. 

To cause these two events to occur during a single phase of 
the clock, the inter-phase time when both phil and phi2 are 
Jow must be utilized for precharging. The basic structure 


of the resulting PLA is shown in Figure 4.7 


sonal! inz an 
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Figure 4.7 PLA Structure. 


MetecEbing» back to the rlocrplan in Figure 4.2, the 
layout of the circuits which perform the logic of each event 
are presented in Apfendix E. The names assigned to the 
dayouts are given below. Event 1 consists of a 33-bit dual- 
mode latch. Event 2, which corputes the P and G primitives 
for each bit,is made up of 16 AND gates, 16 XOR gates, and 
another 33-bit latch. Event 3, which computes the BP and BG 
BEimatives, The IES ({i)'s and the IC23 for each 4-bit block, 
is made up of four instances cf PLA82 and a 29-bit latch. 
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The circuit PLA82 1s made Up of an S=2npue 5- producw 
2-output PLA , two XOR gates, ore AND gate, and one OR gate. 
Event 4, which computes the ES(i)‘'s and BC for each 4-bit 
block uses four instances of fFLA84 to compute the ES(i)'s 
and one instance of PLA915 to compute the BC(i)'s and a 
21 Date) late he The circuit PLAI15 is a 9-input, 15—-produem 
S-output PLA and the circuit PLA84 is an 8-input, 7—-produem 
4-output PLA. Event 5 uses four instances of PLA104 to 
compute the S(i)'s anda 17 bit latch to store results and 
provide the added delay (by taking the output from the shift 
out position, the extra clock cycle of delay is generated). 
The circuit PLA10% isa 10-input, 14-product, 4-outpuceeaee 
With this design, the input to output latency is three full 
cycles of a two-pnase non-overlapping clock; three cycles of 
the clock elapse between the time the addends are presented 
to the chip and the time the sum becomes available at the 
output. In the first three registers the odd number of pits 
is due to the need to store the carry-in value until event 
4. In the last two registers the odd number of bits is due 
to the need to store the computed value of carry-out. 

The resulting rinal layout of Figure 4.8 shows’ the 
actual on-chip layout tocations of each event's logic: In 
additicn to the logic circuits for each event, the circiaeee 
AMP and AMP5 are also Seen. These are driver circuits for 
the high fanout control and cleekysignale. Each takes as 
its input a control signal and produces as outputs the 
control signal and its inverse, both driven by 3-mi1c¢roume 
160-micron transistors. This amplifier is the same design 
used by the output pads to drive off chip loads. 

This final layout represents one implementation of a 
pipelined CLA adder designed for testability. The relative 
merits of this design and others that may have been imple- 
mented can, as yet, only be gualitatively discussed. The 
addition of SPICE 2G7 to the CAD toolbag will provide zrutuge 
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Figure 4.8 Final Layout. 


CMOS designers with the quantitative analysis necessary to 


make decisions involving tradeoffs among primary design 
objectives. | 
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This final design, when simuiated using RNL, functioned 
properly at clock speeds up to 14 meyahertz. Testing of tne 
actual chips produced by MOSIS should give an indication jee 
the accuracy Of RNL*s predict ienms. The following chapter 
presents a test plan to check for proper operation of the 
adder at low clock rates and tc determine the maximum oper- 


ating speed. 
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¥. TEST PLAN 

After several iterations of the design-simulate-redesign 
loop, a final layout was achieved for the 16-bit pipelined 
adder. These iterations provide considerable confidence in 
the lcgical correctness of the layout. Appendix D contains 
RNL simulation results for the full adder. In reading these 
results it should be kept in tind that the adder reguires 
three cycles of the two-phase clock to produce the sum. In 
the first part of the simulation, the inputs were kept 
constant for three clock cycles to facilitate easier reading 
of the results. With these steady inputs, simulations were 
run to verify the generation of correct sums, concentrating 
on those addends that would produce carry propagates and 
carry generates across the boundaries of the 4-bit blocks. 
The last part of the simulaticn utilized different inputs 
each clock cycle. This was done to test the pipelining 
feature of the design, insuring no dependence on repeated 
inputs of the addends to produce the proper sun. 

After fabrication of the chip, application of similar 
inputs to make the same determinations for the actual 
Smecuits will form the initial fortion of the test plan. In 
this chapter a test flan for the verification of computa- 


tional ccrrectness and speed will be presented. 


A. INPUTS AND OUTPUTS 


The first step in testing the chip will be to connect it 
Pomene Cegquired input and output circuitry. To accomplish 
this, the identity of the inputs and outputs on each pin 
must be determined. Microscopic examination of the chip 
Will reveal the logo "16-bit Add", located between the GND 


and Vdd buses for the pads inthe northeast corner (see 
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Figure 4.8 which is repeated below for convenience). Using 


this landmark, the signals on the pads can be labeled as 


follows. 
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Figure 4.8 (repeated) Final Layout 
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The western edge has sixteen input pads for the addend 
we wien the @least signrtricant bit, A{(1), located at the 
northern end. The northern edge of the chip also has 
Sixteen input pads for the addend 3B, with the least signifi- 
cant bit, B{1), located at the eastern end. The southern 
edge has fourteen output pads and two input pads. At its 
western end is the GND input pad followed by fourteen output 
ads "EOE S(16), the most Significant bit of the sun, through 
Si2) . Following S(3), at the eastern end is the input pad 
for Vdd. The eastern edge of the chip has eight input pads 
and eight output pads. Starting at the northern end, there 
mmeemimput pads for phit, phi2, Cin, CON! (control signal for 
the dual mode register of event 1), CON2, CON3, CON4, and 
CONS. They are foilowed by output pads for SREG1 (serial 
Output from dual mode register of event 1), SREG2, SREG3, 
SREGY, SREGS, Cout, S{2), and $(1) at the southern end. 

To supply power to the chip, +5 volts DC should be 
applied to the Vdd pad and QO volts to the GND pad. All 
eed Inputs gncluding clocks and control signals should 
ke elther GND for a logical 0 or Vdd for a logical 1. 
Simulation with RNL revealed some restrictions on the clock 
Signals. For proper operation, each clock should remain 
high for a minimum of 20 nanoseconds and the clock inter- 
phase tine, when both phil and phi2 are low, must be at 
least 10 nanoseconds in duration. Romeloitial testing, to 
insure that charge sharing protlems caused by too short an 
interphase time, and fanout preblems caused by too short a 
Clock phase duration, are not interpreted as fabrication 
errors, tne clock speed should de adjusted so that both 
above clock parameters are exceeded by one order of 
Magnitude. 

The outputs, like the inputs, are at Vdd to represent a 
Logical 1 and at GND to represent a logical 0. The circuits 


used to measure the outputs should have a ee tive © 
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impedance, on the order of one megohn. The output pads of 
the adder are not designed to handle the current source and 
Sink requirements of transistor-transistor logic integrated 
Ci PEUAES. The output measurement circuits should be 
constructed using NMOS or CMOS devicesthat are designed to 


operate Letween +5 vcits DC and ground. 


Bo. ZESTING FOR CORRECT OP EnaAr. Ch 


After connecting the adder to a test harness, the next 
step is to verify the generation of correct sums by the 
adder. There are several inputs that should be included in 
the testing to verify the correct operation of individual 
CibCuits. These are contained in Appendix F. In addition 
to the test vectors of Appendix F, several randomly selected 
input vectors should Ee tested. If the adder should fail to 
generate correct sums, The LSSD features can be employed to 


examine intermediate results. 


1. Intermediate results 





With the LSSD design, a tester can leave input 
levels constant for a long pericd of time and use the shift 
mode of the internal registers to examine the internal state 
of the chip. The rightmost bit of each register is always 
available at the output pad for that register. To obtaum 
the contents of the other bits, the control signal for the 
given register is set to and held at logical 1 while the 
clock Continues tor rine For registers 1, 3, and 5 the 
serial output will be meaningful and stable while phi2 is 
high. The serial output of registers 2 and 4 will be stable 
when phil is hign. Table 3 lists in order the intermediate 
values available at the SREG(n) output pad when the input 


CONn is high. 
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TABLE 3 


Register Serial Outputs 


CloekK ; 
Cycle SREG1 SREG2 SREG3 SREGY SREGS 
0 B1 Pl BP 1 Cah Sa 
1 Be P2 TESS BeZ s8 
2 B3 P3 ITES4Y Gout SS 
5 BY PY BG2 ES2Z S7 
4 B5 P5 IES5 ESY S9 
5 B6 P6 TES6 ES6 Solel 
6 ae P7 TCov ES8 S13 
7 B8 P8 BP3 ES10 SoD 
8 BY PY IES11 ES12 0 

9 B10 P10 ITES12 ES14 Cout 
10 B11 P12 BG4Y ES16 $2 
1 B12 P12 Teor BC1 S4 
2 B13 P13 Bie os Bes S6 
iS B14 P14 meva15 ES1 S8 
14 B15 P15 BG1 ES3 S10 
iS B16 P16 ITES1 ES5 $12 
16 Al G1 eo ES7 Ss 
17 pee G2 rE7s ESI S 16 
18 A3 G3 BP2 ES11 0 
19 AY G4 ese Pot 0 
20 A5 G5 So ES i> 0 
21 A6 G6 BG3 0 0 
a ae G7 IESY 0 0 
23 A8 G8 mie Sag 0 0 
24 AY G9 men i 0 0 
25 A190 G10 BPY 0 0 
26 A G11 E515 0 0 
27 A12 G12 Teo 0 0 
28 A13 G13 Cin 0 0 
29 A14 G14 0 0 0 
30 A15 G15 0 0 0 
31 A176 G16 0 0 0 
2 GiD Can 0 0 0 
eo 0 0 0 0 0 
34 0 0 0 0 0 


C. TESTING FOR SPEED OF OPERATION 


Once the chips containing fabrication errors have been 
culled from the chip set returned by MOSIS, the task 
remaining is to determine just how fast the adder can run. 
Rather than simply increasing the clock rate until the adder 
fails, the duration of the time both phil and phi2 are high, 
and the interphase time should reduced separately. RNL 
Simulation indicates that the circuit which generates S4 
within PLA104 is the limiting circuit for clock phase dura- 


tion (i.e. it reguires the longest time to correctly 
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evaluate its anpues)r RNL simulation also indicates that 
the circuits in PLA104% which generate S$1iand S4 are the 
limiting circuits for the clock interphase dunatwoue 

Since the PLA 1s constructed of precharged dynamic 
circuits, the evaluation clock phase must be long enougiaae 
allow the inputs to drive the outputs to their proper 
values, even if the inputs are the same as those of the 
previous evaluation cycle. This allows the tester to use a 
constant input as the duraticn of each clock phase is 
reduced until the adder produces incorrect results. 

Determination of the clock interphase duration limit is 
more difficult. This 1s because the inputs to a PLA must be 


changing to cause charge sharing problems ) to occinm: Fou 





Figure 5.1 Charge Sharing in a PLA. 


example, in Figure 5.1 assume that the first set of inputs 


is anil=1, in2=0, and that this is correctly evaluated to 


6§ 


produce out=90 waen phil is hign. Now assume that the next 


input is ini=0 and in2=1, which should also evaluate to 


out=0. However, if the precharge time (when the inputs are 
present on the gates of Q2 and ¢3 and phil is still low) is 
insufficient, C2 will not be charged to Vdd when precharging 
ends (C2 


evaluation when ini was high and phil was high). 


was discharged to zero volts during the previous 

Now, when 
evaluation begins (phil going high) the low voltage across 
C2 causes 95 and Q6 to interpret their input as a logical 0. 
As a result the output of the Q5-Q6 inverter pair goes high, 
resulting in an 


Table 4 lists the 


Seausing OS to turn on, discharging C4 and 
mimeut OL logical 1, which is incorrect. 

proper evaluation sequence when precharge time is sufficient 
to insufficient precharge 


and the improper seguence due 


time. (iment smtaote, for) the anputs, output, and capacitor 
woltages a 1 indicates Vdd, 0 indicates GND, and X indicates 
For the 


aAnGgvan KX ind2eates neawther fully on 


somewhere in between. transistors, a 1 indicates 


emu, a0 indicates ofi, 


TABLE 4 


PLA Evaluation Seguences 


Proper evaluation sequence: 
hi e 


P JT ? lou’ 
Zee, (22348 234567 &90 
lec On OO Tl t 1000 10 C0 1770 
(eee SOO 1} 01071011001 0 
Vie ot SOO TT SC 101011001] 0 
OROS SOT OV OOTTOTICOT 0 
Weeeee ls Oldd TO10010C01 0 
Improrer evaluation sequence: 
gen ene oe 7 lineee Wee 
Zoe W202 34 2345676890 
ieee tO OO 100010001 0 
CO OOO 1d FOO 1G11CO1 0 
(eee ote OO 1 SOTO J011601 - 0 
OOO le Ox V1 VOOTTOTICO1. 0 
lemmol SOXXO TOTOXXOR1IO ~ 1 
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NOE UL yersorre. Subsequent inputs of intl=0 and in2=1 may 
produce correct resultS since with constant inputs, each 
precharge time will add more charge to C2 until there is 
sufficient charge to allow the output of the 05-06 iInvenees 
to remain low. 

Thus, to check for charge sharing problems in the 
circuit of Figure 5.1, the inputs must alternate. Likewise, 
in PLA104 to check for charge sharing errors ib OuUtpue yaa 
its inputs must alternate between ES1=0, BC=0 and ES1=1, 
BC=1 as the interphase time is reduced. This can be accon- 
plisied for all four instances of PLA104 simultaneously by 
alternating inputs of 

A = 0001 1001 1001 1001 

B = 0000 1000 1000 1000 

Cres= A 


and 
A 0000 0000 0000 0000 
B 0000 0000 0000 0000 
Cin = 0 


To check for charge sharing errcers in S4, the inputs to PLA 
104 must cycle between BC=1, S4=0, S3=S2=1,S1=0 and 
BC=0, S4=0,S3=S2=S1=1. This may be accomplished for alli four 


instances of PLA104 simultaneously by alternating inputs of 


A = 01710 1110 1110 1110 
B = 0000 1000 1000 1000 
Cin = 1 

and 
A = 0917190111°01 11200 
B = 0000 0000 0000 0000 
Cin = 0 


This maximum speed testing assumes that RNL has correctly 


identified the Slowest circuits on the Chime. RNL 
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Simulations have indicated that the next slowest circuit 
(PLA915) is at least 20% faster than PLA104 (16.0 nsec for 
PLA915 vs. 20.1 nsec for PLA1C4). Also, ail other PLA's 
functioned properly with a 5 nsec interpnase time. 

Should PLA104 prove to be the speed limiting circuit for 
the chip, the actual failure speeds of the chip can serve as 
an indication of the accuracy of the RNL simulation for 


future designs. 


aa 


VI. CONCLUSIONS 
The experience gained in the design of tne adder coupled 
with the clarity of hindsight leads to the following Concise 


sions and reconmendations. 


A. THE CH&OS TECHVOLGGIES 


The CMOS technologies will play asrole of steadily 
increasing importance in the VLSI designs of the future. 
MOSIS is already offering, on an experimental basis, CMOS 
Bulk p-well fabrication with a oone-micron minimum feature 
size. A scalable set of design rules, to allow initial 
fabrication in 3-micron CMOS fcr design verification Benoa 
the far more expensive i-micron process is used, is being 
develcped. 

In the private sector there is considerable research 
aimed at finding an insulating substrate material that does 
not nave the variability and thermal problems of sapphire. 
Progress in this area will remove the drawback caused by 
latchup tendencies in CMOS Bulk. 


Be. CHOS CAD TOOLS 


Though the design tccls currently available at NPS consti- 
tute a complete set for the design of CMOS Bulk p-well 
GIEGULES, the recent CAD _ tool set released by the 
University of Washington/Northwest VLSI Consortium, Release 
2-0 {Ref. 11], coupled with University of Calitornvageas 
Berkeley Winter 1983 CAD tools, represents a more complete 
and cohesive set for CMOS design. When sufficient disk 
space on the Yax 11-780 bececnes available to load the 


Release 2.0, implementation of the Release 2.0 CAD package 


TZ 


is highly recommended. An added benerit of instaliing the 
Release 2.0 package is the cell library provided. The 
library coatains several basic standard cells with known 
performance characteristics. The library also contains the 
standard pad frames used by MOSIS. Though MOSIS does not 
reguire the use of standard pad frames on designs submitted, 
their use does speed up fabrication. 

As mentioned earlier, as socn as SPICE 2G7 is available, 
its addition to the CAD toolkag would be most advantageous 
to a CMOS designer. 


ee, DESIGN OF THE ADDER 


If the design of the adder were to be undertaken again, 
a different approach to generating the sum would protably 
have been used, especially if the new CAD tools mentioned 
above were available. The logic approach to the computation 
would still involve CLA additicn, but 1t would be accon- 
plished using combinational logic and library cells rather 
mian PLA's. Testability would probably suffer greatly, but 
effort would be made to reduce the sum generation tec two 
logical events. Though the level of testability provided by 
the current design should provide considerable insight into 
CMOS Bulk p-well performance and CAD tool accuracy, there 


would Fe no need to repeat the investigation. 
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SPICE MODEL CARDS FOR 3-BICRON CHOS=PNSDEV Gere 


CMU models for MOSIS 3-micron CMOS Bulk p-weil devices: 


Fast Models 
emodel n nmos vto=0.4 tox=0.7e-7 lambda=ie-7 Il1d=1e-6 


#xj=1.1e-6 gamma=.3 uo=500 cbd=5e-4 cbhs=5e-4 


-model p pmos vto=-.4 tox=0.7e-7 iambda=1te-7 Il1d=1e-6 
+xj=1.1e-6 gamma=.3 uo=300 cbd=3.5e-4 cbs=3.5e-4 


Siow Models 


emodel n nmos vto=1.0 tox=0.8e-7 lambda=i1e-7 Ild=.5e-6 
+xj=0.6e-6 gamma=1.3 uo=400 chd=6e-4 cbs=6e-4 

-nmodel p pmos vto=-1.0 tox=0.8e-7 lambda=le-7 l1d=.5¢-6 
+xj=0.6e-6 gamma=.9 uo=200 cbd=4.1e-4 cbs=4.i1e-4 


MIT Models for MOSIS 3-micron CMOS Bulk p-well devices: 


Slows Suow 
-moagel nss nmos level=2 rsh=20 tox=650e-10 Il1d=.25e-6 
+xj=.35e-6 cj=6e-4 cjsw=4e-10 wo=475 vto=1.2 
+cgso=1.3e-10 cgdo=i.3e-10 nsub=1.5e16 
+vnax=5e4 pb=.7 nj=.5 njsw=.5 


+neff=2.5 ucrit=8e4 uexp=. 25 


-model pss pmos level=2 rsh=8&0 tox=650e-10 Ild=.25e-6 
+xj=.35e-6 cj=4.1e-4 cjsw=2.5€-10 uo=190 vto=-1.2 
+cgso=1.3e-10 cgdo=1.3e-10 nsub=5e15 tpg=-1 
+yvmax=5e4 pb=.7 mj=.5 njsw=.5 


+nerf=2.5 ucrit=8e4 uexp=. 15 
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East D-type Yalow n-reype 


-model nfs nmos level=2 rsh=30 tox=600e-10 Il1d=.25e-6 
¢xj=.35e-6 cj=6.0e-4 cjsw=4.0e-10 uo=475 vto=t.2 
+cgso=1.9e-10 cgdo=1.9e-10 nsub=1.5e16 

+vmax=5e4 pb=.7 nj=.5 mjsw=.5 


+neff=2.5 ucrit=8ed uexp=. 25 


emodel pfs pmos level=2 rsh=20 tox=600e-10 l1d=.40e-6 
+xj=.60e-6 cj=2.0e-4 cjsw=1.0€-10 uo=270 vto=-0.6 
+cgso=2.0e-10 cgdo=2.0e-10 nsub=0.3e15 tpg=- 1 
+vmnax=5e4 pb=.7 mj=.5 Djsw=.5 


+neff=2.0 ucrit=8e4 uexp=. 15 


Fast p-type Fast n-type 


-model nff nmos level=2 rsh=10 tox=550e-10 l1d=.40¢e-6 
+xj=.60e-6 cj=3.0e-4 cjsw=2.06€-10 uo=675 vto=0.6 
+cgso=2.5e-10 cgdo=2.5e-10 nsub=0.5e16 

+ymax=5e4 pb=.7 nj=.5 njsw=.5 


+neff=2.5 ucrit=8e4 uex p=. 25 


-model pff pmos level=2 rsh=20 tox=550e-10 l1d=.40e-6 
+xj=-60e-6 cj=2.0e-4 cjsw=1.0€-10 uo=270 vto=-0.6 
+cgso=2.5e-10 cgdo=2.5e-10 nsub=0.3e15 tpg=-1 
+vmax=5e4 pb=.7 mj=.5 mnjsw=.5 

+tneff=2.0  ucrit=8e4 uexp=. 15 


uD 


Slow p-type Fast f-t7 2e 


-model nsf nnos level=2 rsh=10 tox=600e-19 Ild=.40e-6 
+xj=.60e-6 cj=3.02e-4 cjsw=2.06€-10 uo=675 vto=0.6 
+cgso=2.0e-10 cgdo=2.0e-10 nsub=0.5e16 

+vmnax=5e4 pb=.7 nj=.5 hjsw=.5 


+neff=2.5 ucrit=8e4 uexp=. 25 


-model psf pmos level=2 rsh=&0 tox=600e-10 Ild=..25-6 
+xj=.35e-6 cj=4.1le-4 cjsw=2.5€-10 wuo=190 vto=-1.2 
+cgso=1.2e-10 cgdo=1.2e-10 nsub=5.0e15 tpg ——al 
+ynax=5e4 po=.7 mj=.5 mjsw=.5 


+nerif=2.0 ucrit=8e4 uex p=. 15 


716 


RULEC (CAD) 


NAGE 


APPENDEX B 
UNIX MANUAL ENTEY FOR RULEC 


CAD Toolbox User's Manual RULEC ( CAD ) 


rulec — Compile design rules for Lyra 


SYNOPSIS 


rulec [lo] rules 


DESCRIPTION 


feulec is a shell script with the following processing steps: 


) 
i) 


i) 


iv) 


_ The actual Lyra rule compiler is invoked to translate the symbolic rule 


description, rules.r, to lisp code, rules.L 
The lisp compiler, Liszt, is invoked to compile ruwles.1 to rules.o 


rules.o is loaded into Lyra.proto to generate an executable lisp Lyra, 
rules, 


The intermediate files rules. and rules.oare deleted. 


The following options are supported: 


-1 (load only) No compilation is done. Previously compiled rules, rules.o, 
are loaded into Lyra.proto to generate an executable Lyra, mes. This 
option is useful mainly at Berkeley, where Lyra.proto changes frequently. 

—o (save object) Name.o is not removed. Enables Tulec 4 rules’ in the 
FOtone. 

FILES 

~cad/bin/rulec — rulec shell script. 

~cad/lib/lyra/Rulec1 — lisp rule compiler 

~cad/lib/lyra/Lyra.proto — Lyra sans compiled rules code. 

~cad/lib/lyra/*.r -- standard rulesets. 

~cad/lib/lyra/DEFAULTS -- gives default rulesets for Caesar technclogies. 

sil ALSO a 

Lyra (CAD) 

Maiszt( 1) 

AUTHOR 


Michael Arnold. 
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PRESIM USER'S GUIDE 


Config file: used to calibrate ENL 


capm2za 
capm2p 
capma 
capmp 
cappa 
cappp 
capda 
capdp 
cappda 
cappdp 
capga 


lambda 


- 00000 


- 00000 


-00006 


- 00000 


-00006 


- 00000 


200010 


- 00060 


-00010 


- 00060 


-00057 


1.0 


vs 


PRESIM User’s Guide 


UWINW VLSI Consortium 


Department of Computer Science 
University of Washington 
Seattle, WA 96195 


(This document is based on portions of the document "User's Guide to NET, PRESIM and 
RNLNL,” by Christopher J. Terman, Laboratory for Computer Science, MI.T.. Cambridge, MA 
02139.) 


One must first convert the sim file to a network file suitable for use by RNL or NL — to do this 
we run PRESIM: 


presim foo sim foo [config] options... 


which converts the file foo sim into a binary file for RNL/NL called foo. 


The -g option: 


Suppresses the sum-of-products formation. This may be desired if you think 
sum-of-products is formed wrong otherwise the advantages of the transstor and 
node reduction make this option unattractive. 


The -¢ option: 

-cfile minvalue 
writes a list of node names and capacitances to the specified file. Only capacitances larger than min- 
value will be included. 
The -t option: 

-tfile minvalue 
writes a list of transistors and RC values to the specified file — there are two entries for cach transis- 
tor. The R's come from the size of the transistor, C’s from the source/drain capacitance. Only RC 
values larger than minvalue will be included. 
The -p option: 

“presist voltage 


provides a worse-case estimate of the circuit power consumption by assuming thar all the pullups 
(DEP or LOWP devices with drain=VDD) are all on simultaneously. "Voltage’ specifies the supply 


UW/NW VLSI Release 2 -l1- 1¥ 1/83 


aS 


en ee ae a 9 me pec ee Ometrtes cere, Qe EGE eh Gals oe ee OT SY 


UW/NW VLSI Consortium PRESIM User’s Guide 


voltage, for example *-p5” specifies a VDD of 5 volts. The result is printed after PRESIM completes its 
other processing. When figuring the resistance of a puilup device the “power” characteristic resistance 
as set in the config file is used. 


The optional third file (config) specifies various electrical parameters. The internal values (the 
defaults) are a generic ser. They do not reflect any particular fabricarion process. (UW-NW VLSI 
NOTE: A configuration file is provided in the source code rhat duplicates the internal settings as an 
example of how this file could be used. [In addition we note that, the resistor values are stored first 
sorted by width, then by length not by rhe ratio. Values act explicitly provided in the configuration 
file are estimated by linear interpolation.) The formar of this file is lines of the form 


parameter value comments... 


Lines beginning with °;” are treared as all comment. The parameter names and their default values 
are: 


; configuration file for “standard” MPC process 


capm2a 00000 ; 2nd metal capacitance ~ area, pf/sq-micron 
capm2p 00000 ; 2nd metal capacitance ~ perimeter, pf/micron 
capma 00003 ,; lst meral capacitance ~ area, pt/sq-micron 
capmp 0000 ; Ist metal capacitance -- perimeter, pf/micron 
cappa 00004 ; poly capacitance — area, pf/sq-micron 

cappp 0000 ; poly capacitance - permeter, pf/micron 

capda 0010 ; a-diffusion capacitance ~ area, pt/sq-micron 
capdp 00060 , a-diffusion capacitance ~ perimeter, pt/micron 
cappda 00010 ; pdiffusion capacitance — area, pf/sq-micron 
cappdp 00060 .; pdiffusion capacitance — perimerer, pf/micron 
capga 0040 ; gate capacitance ~ area, pf/sq-micron 


lambda 2.5 , microns/lambda (conversion from sim fle units 
; to units used in cap parameters) 


lowthresh 03 ; logic low threshold as a normalized voltage 
highthresh O8 ; logic high threshold as a normalized voitage 


cntpullup 0 ; <>0 means that the capacitor formed by gate of 
, pullup should be included in capacitance of ourput 
; node 

diffperim 0 ; <>0 means do not include diffusion perimeters 


; that border on transistor gates when tguring 
; ndewall capacitance (°) 


subparea 0 ; <>0 means that poly over transistor region will aot 
; be counted as part of the poly-bulk capacitor (°) 
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diffext 0 : diffusion extension for each transistor, ic. cach 
; transistor is assumed to have a rectangular source 
; and drain diffusion extending diffext units wide and 
; transistor-width units high. The effect of the 
; diffusion extension is to add some capacitance to 
;the source and drain node of each transistor — 
; useful when processing the output of NET to improve 
; the capacitive loading approximalions without adding 
; explicit load capacitors. diffext t specified in 
; lambda (it will be converted using the lambda factor 
; above). 


resistance channel context width length resist 

; this command specifies the equivalent resstance for a transistor 
; of type channel with the specified width and length. Transistors 
; matching this entry will have the specified resistance; linear 

; interpolation is done if the width and/or length is not matched 

; exactly. 

; channel is one of ’enh’, "dep’, “intrinsic”, low-power’, 

‘ *pullup’, or “p-chan’ 

context is one of “statie’, "dynamic-high’, “dynamic-low’, or “power” 
width is given in lambda 

length is given in lambda 

resist is given in ohms 


ee 89 wt @et 


(*) These paramters should be 1 only when processing the output of 
the node extractor. They cause various corrections to be made 
to the interconnect component of a node's capacitance — usually 
only extracted sim files have information regarding interconnect 
capacitance. 


PRESIM uses these parameters in calculating the capacitance for each electrical node and the resis- 
tance for each transistor channel. 
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APPENDIX D 
ADDER SIMULATION 


The following two listings are: (1) the KNL command file 
for the entitle Chromeo nom the results of running that 
command file. In addition to this overall testing, all the 
layout of Appendix G were simulated individually. A Dee 
Teature of RNL is the indication of when a watched node 
changes state. Thus, by making all the outputs of a circuit 
watcned nodes, RNL will provide the minimum time duration 
for a clock cycle to produce the outputs (the longest time 
indicated by the simulation). This can be confirmed by 
running the simulation with a faster clock, resulting in 
outputs of X (neither 1 nor 0) where insufficient time has 
rFeen allowed. 

RNL simulation to determine the minimum time for preGe 
harging the PLA circuits is only slightly more involved. 
For each product term in the FLA, alternating inputs are 
selected that will result in maximum amount of N+ diffusion 
needing to be charged from 0 vclts to Vdd. Then as these 
inputs are alternated, the PIA precharge time is reduce 
untit the circuit fails to produce correct results. For the 
PLA's in the adder, visual inspection for the product tern 
with the iongest precharge requirement was done by looking 
for the longest Nt diffusion line which must be charged 
through the maximum number of transistors. The visual 


inspection results were confirmed by RNL simulations. 


SZ 


Bet 22 


(looefile 
(loac 
(load 
(reageretrork 
(sete noaces 


erreecout rhii 


Centiadq nodes) 


1 aj 22 63 a4 
fee 62 DS HS 
feGanl cone 
erm £hil 

(defvec 


foe 2d 1 oes 


"(al 62 a2 aa 
Paewais €@15 cl ©&2 23 44 
Pi6é si s2 s3 s4 Ss& 


earns 
er.i2 
Serireetceke ehl} chi2)) 


Sebi boa") 
BilwS&Ce i’ ) 
Pies ite l” ) 


ae 


st s7 sb 8S 


65 at ai 


erinr,cre Pece 3 


Srur si) S42 


af a9 a1lC ail aj12 a13 
nS be ey were ns BiG E11 


El? 613s 614 Bis 
s135. S14 si5 sic 


Gris cor) ecenzg cons eon4 eens )) 


ab a7 ac 
P6 EF? Pe 
COlls Cons 


as 
BS 


Bo 210 Vert 


ac alu ali ai12 ai13 aie aiS ai6 
Sl? esa el 4h oe Le 


(defvec °(rEin aaca ais alS ai4 ali3 ai2 all 610 
ay at a7 86 a5 a4 a3 62 al)) 
(deftvec “(bir ofhb ny6 b1S ete 613 12 bid cL 
Dye pseo? oe £5 64 8S E2 bE)) 
Gaetvec “(rir sur cout $16 siS sl14 $13 siZ $1} sj10 
Soest svestase S453 s2-si1)) 
Meerereport “("state is now" (vee eloecKks) cin cout neeline 
(vec ¢8a4) newline 
Cvec SEtCE) newline 
(vee sur))) 
(defur ss(Caurry) 
ster iner)) 
(defun ¢ecveles (a) 
(repeat {1 1 a 
coerce gner, ro) 
Css °(x)) 
wseeto iner 25 ) 
Ghee Gon) ) ) 
as? - '€X.))) 
Geete mer 10) 
lee (OM) 4) ) 
Css "(x )) 
Gseta ancr 25) 
Gne* (eri?) ) 
Gs CX) ) 
Gir Cops.) 
) 
) 
Peveles 5) 
(invee °(Caaaa vn6O60901111090001141)) 
emevec (bEbE Ob1111090%L11100C090)) 
Ceyeles 3) 
mrmvec “CEebrt CocecococivdvnvouL)) 
Seveles 3) 
Poein 
(cycles 3) 
leon 
Ormvece “(aa@ae 0F11311111121111111)) 
Mmemeec “(BtbrtE OoCOCClOCGCOULUCGGE)) 
(cyeles 3) 
Necin 
(cycles 3) 


83 


Cet (29s. SS 


Pein 

Cinwvece -CEPEE 
(eyeles 3) 
(invec “°(pbrbre 
(cycles 1) 
(invec °’(aaaa 
(cycles 1} 
Cinvec .CCPorE 
(cycles 1) 
(invec “°lasae 
(cycles 3) 
Cinvec.“Cebee 
(cveles 1) 
Racer 

Cévcles 3) 
(invec °(aaré 
Cinviee “Ceene 
(cycles 1) 
Fein 

(cycles <¢) 


Jor, CL Te. eCre Fece, 2 


OoCQNCOGCCGHO0OV0G1)) 
Oc oUNGodendnacdgaogaeg) ) 
GhHCGULUIDNECGNNCCGG00) ) 
Celt py yl 
CROEOQECTECNVIGUBUFD ) ) 


OS L121 01I 12 tila 


GCHUOCHNNGHOYRACNOON)) 
Or cagoescvuroeaagorc ) ) 
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Loédina uwsir,.1 
Dore loadino uwsijin.,l 
¢ 3086 nodes, transistors: enn=1494 intrinsic=s60 peenan=1141 dep=0 loweoower=0 pullup=s0 resistc 


Ster bpecins € OC ns, 
phi?=s=C #8 0 


peniisc @# 0 
cinsx=C @ 0 
cors=s= 2 0 
con4s0 2 Q 
cong=0 en 
con?st a 0 
eoni=0 F 0 
r16s30 @ 0 
b15=s0 @ 0 
pi4=0 @ QO 
big=C @ 0 
bp12=0 #8 0 
h11=<=0 @ 0 
bpin=sCt e@ Q 
ngs ae 0 
pR=sr a QO 
b7=0 @ 9 
n6=N a 0 
p5=0 a 0 
rgéanr 60 
b3sN &€ 0 
h2=sn9 @ Q 
risn é€ 0 
ai6=0 8 0Q 
ai5=0 @ 0 
aj4=0 a Q 
a1l3=0 @ 0 
ai2=c e 0 
alisO 4 ¢ 
al0=sG a C 
ag9=C e 0 
asgs=0 e 0 
a7sc a 6 
a4Q=C @ Q 
a5=0 e@ 9 
a4=0 4 90 
a3zs0 e@ 0 
a2=0 a 0 
aisC a Q 


ster becins @ 10 ns, 
phiisi @ 0 


Step hecins @ 35 ns, 
Bmris0°a@ 0 


Stec becins & 46 ns, 
pni2=1 2a 9 

s$i6=0 @® 14.2 

s°=0 a 16,4 


Ss 


Ft 


ee -6° 15273 1986  eEniewloo ace 2 


$ii=0 185135, 4 

$1350 @ 16,4 

s1S=00 @ 516.4 

s7s0 e&@ 16,4 

$520 @ 16,4 

$329 @ 16,4 

$14=0 2) 16,5 

si12=0 @ 16,5 

si0s0 # 16,85 

sRs0 a 16,§ 

sf20 @ t&,5 

$420 @ 16,5 

s2=(C 6 16,7 

$1=0 4 20 

stete is now: 

Current time= 70 
Clocks=0b01 cin=0 cout=Xx 
aaaa=sQroncdnangennnencod 
eebr=06b00C09000000000000 
sum=ryooq0qqona00onoa0ono0no 


Sten beains @ 70 ns, 
pA{2=n @¢ 0 


Step becins @ RO NS, 
enijsi € O 


Sterv beains & 105 ns, 
eniisa e@ Q 


Step bpecins ® 115 ns, 
phi2si e& 0 

Cnurt=One- 72,9 

state is now; 

Current timrme= 140 
Clocks=Mb01 ecin=0 cout=9 
seaaa=Qpodocdoqaonconcoqnod 
bohr=edrdngrgocgonoononod 
sum=050000090N000000000 


Sten beains @ 149 As, 
chi2S0 Ff © 


Step recins @ 15¢C ns, 
phlil=i»“, 0 


Stes becins ©€ 175 ns, 
phii=0 8 0 


Ster begins 6 185 ns, 
pALI2=1 a Q 

state is now; 

Current times 210 
Clocks=O0F01 cin=C couten 
aaaa=O0rocanogcodceraconad 
dbPoop=Orcgoococaqdcaccoa0nn 
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mee € 18:23 1984 enie.los Face 3 


sum=0b00000000000000000 


Step reains «© 210 ns, 
pnhi2=0 @& 0 


Srer tecins ¢ 220 ns, 
ophiisi s a 


Ster vecins @ 248 ns, 
Soe @ 


Seep necins € 255 ns, 
ohi2=1 e& 4 

state is now: 

Current titres 290 
Clocks=0b01 cin=9 coutsn 
aaaa=0nonnooconadononnno 
bhhesobonrooncooononrgng 
sum=Or000000000C00C0009C 


Ster keains #& 28C ns, 
onhi2=0 € Q 


Ster reoins & 280 ns, 
phniisi ® O 


Ster becins @ 3158 ns, 
Sms O @ 0 


Ster reoins # 325 ns, 
Bima7=1 ef 0 

stete is now: 

Current time= 350 
elroecks=9b01 cinsC couts0 
e@asea=NE000000I00Th00900 
breh=soRrcocNOoOorOoCceOorOoton 
sum=06000609000900000000 


Step neains € 350 ns, 

pi6si ea 0 

b1i5=3 @ C 

bid=1 Pf 0 
bissi @ 0 
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Steer beoins & 350 ns, 
onAfizs} 2 06 


Sten beqins © 385 nas, 
engiisod @ 0 


Step treqins « 395 ns, 
phi2?s1 a 0 

state is now: 

Current times 426 
cleeks=0b0!1 e€in=0 coutso 
Baaa=Npngndi11i0N0nor1iid 
eheh=0b1111000011110N00 
sumr=OQpndd0ndgocroon000000 


Step hecins «© 270 fs. 
pPAL7=0 4 QO 


Ster reaqins ¢ 43C ns, 
priiszi e289 


Stet Deans @© 455 ns. 
chiis0 @ 06 


S@é@c Becins @ 465 ns, 
opAi2=s1 a4 0 

state {is now: 

Current times 490 
Clocks=0Or01 cin=0 cout2=9 
aaaa=0r0d00111100N01111 
Bere=GeE111100C011 110000 
sum=orogoocgoced00000n0n0 


Steer beains 2 49C as, 
phi2sec @ QO 


Sten becins e@ SN as, 
Shid=s{ e270 


Stee beGins xe 575 ns, 
phiisd e Oo 


Step Peocins @ 535 as, 
eni7=i = 4 
Silosi a 14,6 
s9=1 @ 16,7 
Sii=j] 8 14,7 
Sis=s) @ 16,7 
$15=1 ©¢. 16.7 
s7=i SL t6,7 
sS5=i1 @ 16,7 
$3=1.60.16,7 
si4si ¢ 16,8 
Si2=1 e He. 
sio0si @ 16.,€ 


8 8 
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s8=1 @ 16, 
s6=1 2 jJo0.8 
s4si ? 16,8 
s2=1! @ 17 

siti @# 19,1 


state is nows: 

Current times 560 
€loeks=Ob01i1 ecin=0 coutsod 
aaeaaz0r0000111100001113 
bbbrsCbii1l110q0dg00114110N90N0 
SUmeOEOLEILiiiirijsi1i11 


Step teains © 560 ns, 
nh9=1 @ 0 

bisi 8 0 

bissc a 0 

EilS=C e C 

b1420 a 0 

bi3s0 @ 0 

eR=0 
07=90 
bé6=sn 
r5=C 
pPhi?7so 


2 
@ 
@ 
@ 


ywngaod0deon.n 


9) 


Ster becins © 570 ns, 
Emai=1 © O 


Sten becins 8 S95 ns, 
mpnhiis9 ® O 


Ster beoins ® 605 ns, 
pmhigei e 49 

state is nows 

Currert time= 630 
Clocks=Cr0ji cin=n cout=0 
aeaaazOrnonci11i100001111 
rFepr=enrodnono0icononnn4 
sumsQOrOL2111193191391914141 


mien oSeains @ 630 ns, 
phHi2=0 @ O 


Step beoins @ 640 ns, 
pniisi #@ 0 


Sten becins &€ 665 ns, 
phiisd #® O 


Scec becins @ 475 ns. 
SEnizg=i ° 0 

State is nows 

Current times 700 
Clockss0b01 cin=C coutso0 
aAaa=Ortooni1i1id9000niiit 
PHbbp=NbAQndagricanonno! 


a9 
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SURMSOR OTL 2291 Pit ty it 


Ster beoins &® 700 ns, 
phi2=C a O 


Stécwmpeoins -& 7102 ns, 
pALisi 2 0 


Ster beocins @ 735 ns, 
ohiisl @ NO 


Sten pee@ins © 745 ns, 
pni2si e oO 

s16=50 @ 14,2 

s9=0 @ 16,4 

siisO S 16,4 

si15=0 @® 16,4 

s7=>0 @ 16,4 

§$3=0 @ 16,4 

$14=0 ® 16,5 

si2=0 € 16,5 

siusoO © 16,5 

s8=0 @ 16,85 

$6=9 ®@ 16,5 

s4=0 6 16,5 

§2=0 e& 16,7 

si=C & 20 

state 1s novws 

Current times 770 
Cloekss0b0l cin=0 coursed 
a9aaa=0r00901113900001111 
bbe r=0rngcnogndg100000001 
SUPHSOFONOONLANNNNON1LOCOO 


Step becins & 770 ns, 
cin=! 7s @ 
pnhir=so B O 


Ster becins @ 780 ns, 
eniisi @® OQ 


Ster bheains @ RNE ns, 
prAiisO0 8 0 


Step beains € 815 ns, 
phi2zsi 6 0 

state is now: 

Current times 84u 
Clocks=0b01 cins! ecouts0 
a8aag=0b60900111100001111 
eErbrsOrdngdcocnicangonnoa, 
sum=db000010C000G010000 


Stec beqins @® 84C ns, 
phi2=0 2° 0 


90 
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Ster becins # #50 ns, 
phii=si # 0 


Sten bpecins @ &75 ns, 
pniisd @ 0 


Stern recins € 8858 ns, 
eni2=i 2 0 

state is nots 

Current times 916 
clocks=0bU1 cinsi coutrso 
aaaa=obr0dcn131130000I 141 
brhbr=0r0000C00100000001 
sum=0r00001000000010000 


Step beaqins ® 910 ns, 
ohi2z=0 * Q 


Ster reoins e@ 920 ns, 
phniisj @ A 


Ster heoins & 945 ns, 
phii=so * O 


Step pecins & 955 ns, 
pnhi2=!1 © 0 

Ssi=i a 19,3 

state iS now? 

Current time= 980 
Eeleexks=0b01 cin=1 cout=0 
a28aa=0r0d0n0ng111100001111 
Phbr=ordnNNdNo1CnH0CO0O! 
sum=0rnoggde100009N9010001 


Ster kecins @ 9f0 ns, 
ai6=1 23 0 
ai5s1 e 0 
aig=si ao 
ai3=1 e 0 
ab=4 
a7v=i 
a6=1 
ad5=4 
F920 
b1i=0 
ein=0 @ 0 
oni?=0 @ 0 


@pwnvnwD DOD D 
Dg Ooo D 


Ster becins @ 990 ns, 
ehiisi e 9 


meeep heoins € 1015 ns, 
pnhizy=0 a 9 


Step hecins & 1025 ns, 
ohi2?si e a 


oh 


Pee & 35:23 {1984.5 enio;g loa fade. 2 


State is nows 

Current times 1950 
eleers=Crol cein=C coutso 
asaa=NOb1r1iririririsiriit 
rbrb|eNEonoonaonarooganaa 
sum=eNecooni1anecoeodsnnel 


Step. Peains & 1020 ns. 
pnhi2=so0 a n 


Ster recins @ 10f0 ns, 
ohiisi 8 O 


Sten beocins @ 1088 ns, 
phiiso 38 0 


Sten becins #® i1N95 ns, 
pnAigz=si. © 0 

stare is now: 

Current times 112¢ 
€lceks=060! ein=ae eovurtsn 
@a@asOerysyiririsiirisiiii 
beeb=sCra00cnqnnnunog00dng 
SUMZEORONCOCINONNDNU1L9I0D) 


ster bpeoins ® 11270 As, 
pni2=c e 0 


Step peains 8 1130 ns, 
eniisi @ Oo 


Step beoins @ 1155 ns, 
phijso0 #8 O 


Step beains © 11F5 ns. 
phi2=i @ 0 

si6=1 @ 14,6 

s$9=1 @ 16,7 

siisi ® 14,7 

si5=1 @. 15,7 

Sj={, © le.) 

$321. F167 

s$i4si 8 16,8 

$i2=1 @ 16, 

si0si @ {16.8 

s8=1 @ 16,8 

s6=1 @ 16,8 

$421 @ 14,8 

s2=1 @ 17 

state is nows 

Current time= 1190 
cloeks=(CbO0i eins0 coursn 
S@aaasOrirrr1jissy1iIr1priris 
bhbbs=0n000NnN00NN00N00000N 
SumM=SQrOUT1TIit~sityiiiyii 


OE 
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Sten reains @ 1199 AS, 
cins{ a 0 
phi?=0 eB 9 


Ster bedins @ 1200 ns. 
oniisei @ Q 


Step peains #® 1225 ns, 
oniien @ O 


Sten treains @® 12735 NS, 
phi2=ji 28 Q 

state is now: 

Current times 1260 
Cloeks=s0h01 ecinss cout=0 
aaaasdeczilirpLiririlijizi1 
chbrsdRrnanunnadgrdgodnngdo 
som=eor0ti111111311111111 


Ster heqins #&® 17640 ns, 
enaz=0 FF 0 


Smee hredins @ 1279 ns, 
BSnai=1 &° Oo 


Steo resins & $2798 ns, 
pRaqg1is0 a 0 


Sten tecins © 1305 ns, 
mni2si @ 49 

state is now? 

Current times 1339 
Clocks=s0b01 cin=1 cout=0 
asassCeliL_iritzisyi1iiiiii 
bhbnsdrogqqdonanndconno0ono 
SumeOb013211311112111131 


Ster beains © 1330 ns. 
nmhi2=0 #@ 4 


Ster becins @ 1349 ns. 
oniisi e 9 


Stes pedqins © i13€5 ns, 
phii=soO e@ 0 


Step rpeoins @ 1375 ns. 
ohi2=i1 6 9 

si4=0 @ 14,2 

s9=0 @ 16,4 

silsC @ 16,4 

$1320 8 16,4 

$15=0 Aa 16,4 

eo7=. © 16,4 

s5290 © 46,4 

$3290 e@ 16.4 


ee ee ne 


Dee 6 15:23 1954 eEenic.jJ¢o Pade 10) 


St4=0.. 6 16.5 
£12=0 8a ices 
sfoso0 4°16,5 


sad=0°"a 16,5 
s6=0 @ 16.5 
s4=0 @ 16,§ 
s$2=9 8 15.7 
$1=0 @ 20 


Sout =1. "9 25,1 

state is nox? 

Current times 1400 
Clocks=0b01 ein=1 cout=i 
Seeasoelsisyz111iiiiiriiry 
Pbhbpsornonggqognanvdgodgn 
sur=a=OrinNoncnangcoonnneoa 


Ster becins © 1409 ns, 
mist 2 0 

ein=o 2 0 

oAL?=0 & OQ 


Ster beacins 8 1410 ns, 
praiisi « 


Sten heaqins @ 1435 ns, 
enlite¢ 20) 


Stern eons 6714465 ns. 
phi2=1 8 0 

state is "ows 

Current tine= 1470 
clocks=0b9! ein=0 cout=1 
@aaasCr1iii 1111112112123 
prbb=06b0N000000000000003 
sum=Nb1GOND0Q0N0CONACNC 


Steo beains € 1470 nS, 
phi?zso0 ao 


Sten pecins © 1480 ns, 
penile) eo 


Step béeoins € 1505 ns. 
phii=0 @ 0 


Stee beeins # 1515 ns. 
Onhi2z=1 8 O 

state iS now? 

Current time= 1540 
Clocks=(b01 cin=C coutel 
Baae=OoiisviisiLiiisiLriti 
bhbhsorfoqg00onranog0gnoi 
sum=0b100N09000000009900 


Ster bheains #* 1549 ns, 
NAL7=0 e@ A 
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ster bpeqins & 1550 ns, 
ohi1isi @ 0 


Steer peacins #& 1578 ns. 
onilist & 0 


Ster pecins @ 1585 ns. 
phi2=1 @ 0 

state is mow; 

Current times 1610 
Clocks=Nh01 ecin=0 coutel 
esaag=Nriii11111111411111 
bben=CEeNnangdgonnnoonnodl 
sum=0r1d00nqn0qd00Nncnto0400 


Stem recins «= 1619 ns, 
biscd @ 0 
ohi2?s0 « 0 


Ster reaqins @ 1620 ns, 
pniizi « O 


Steo necins @ 16485 ns, 
phasjish 2@ 0 


Sten rmeains © 1655 ns, 
pnhizsi @ 9 

state is now: 

Current times 1€80 
clocks=Ob91 ecjsn=0 coutel 
SeSsme=OP LILI ti1iiiiti 
bebr=sOCrnonng0ngnnoddgngna 
sumenbiOo0ocgennrnngngnd 


Seaee begins @ 160 ns. 
ai6és0 mA 

@i5=0 e 
ail4=n e@ 
ai3=0 e 
@ai2=¢C @ 
aiis0O e 
ain=0 e 
a9=0 
a&=0 
a7=0 
a6=0 
a5=0 
a4=0 
a3=9 
a2=0 
aicQ 
pni2=o 


oao90 0920 0 


DPOoaD?D DD D ® 
anoaoroo0oqo:a9a 


0 


Ster peains @ 1699 ns, 
Paiisi @n 


a 


apg 4 £ te— 


Mee 6 15273 19F 4 enie.loe Pacer: 


Step beains @ $715 ns, 
ohiiso e@ 0 


Ster nheains & 1725 ns, 
enj2s1 4 0 

state is naws 

Gurrene tires 1759 
eleexss0r0!l cinst cout=i 
aaaa=0r006000000N000090 
PhRPBSORNNONCOOCINCNACNCOO 
sumsder1Cungncdocoontnodnd 


Ster neains & 17580 ns, 
rt6s1 @ 
piSsi e 
bi4et 6 
bils=1 3 
P1221 @ 
hasta 
bidsi ¢@ 
Posi 
bB=1 
n7si 
hes 
b5=1 
hAe] 
b3=1 
K2=1 
his} 
eoni2=4 


a gO 9090 009 


D> D DA YD DOS D PN 
areodorvDvoasdrr29 3 


Oo 


Ster beains # 1760 ns, 
Paiist ©) 


Steo heains ® 1785 ns, 
ohiis0 @ 0 


Stec fecins © 1795 ns. 
eni2si1 ° 0 
s16=1 @ 14,4 
S9o-1 <> 16,7 
Slisl #1627 
Sigel 6716.7 
Sis=i 28 16.7 
s7=i @ 16,7 
s5=1 ® 16,7 
s3=1 @ 16,7 
$i14s1 6 14,8 
s$12=1 @ 16.8 
si0si1 @ 16,8 


$R=i @ 36,8 
sf=l e@ 16,6 
s4=i1 ° 16,8 
$2=1 e 17 

sizi ¢ 19,1 


Dec 6 18:23 1984 ec¢hip,loc Pande 13 


ecout=c F&F 22.9 

state js now: 

Current times 1220 
Clocks=0F01 cin=f coutsd 
aaaa=0rocoo0ngcaonconntrgnnao 
Pemee=orliyisiszilisii1is: 
eomecrOliisiti1i11111111 


Stern becins @ 1820 ns, 
ai7si 2 0 
oni2=0 @& 0 


Ster beaoins & 1630 ns, 
Moiese: ©& 6 


Ster necqins ® 1855 ns, 
eniiso @& Oo 


Stec becins @ 1865 ns, 
OnLiP=si 2 9 

s16=0 @ 14,2 

s9=0 #@ 16,4 

s$13=0 @ 16,4 

s1i3=0 # 14,4 

s15=0 4 14,4 

s7=0 8 16,4 

s5=0 © 16,4 

$3=0 9 16,4 

si14=s0 @ 46,5 

gsi2=0 eP 16,5 

s10=0 @ 16,5 

s8=eQ @ 36,5 

s6=C @ 16,5 

s4=0 € 16.5 

e2=0 9 16,7 

si=0Q @ 270 

state is now: 

Currert tires 18699 
erecks=OF0!) ¢cir=0 couts0 
aaaa=Onnondgidoannqgoconooa 
mor = OPLVi1iiiii1iii1i1i1i 
SsUmM=SDDODODONNDDDINNNLCOND 


Ster reocins @ 18979 ns, 
b1i2=0 @ 0 
pni2=0 a Q 


Step vredins #& 1909 ns, 
pnhiisi @ 0 


Ster hecins 8 1925 ns, 
Phii=o Re 0 


Step beains ® 1935 ns, 


pni2=ss e@ 0 
sli6=1 @ 14,6 


oY 
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s9=i1 8 16,7 
siisi @ 16,7 
Sis=i © 16.7 
SiS=i a) 6.7 
s7=1/ 2 16,7 
s5=1 e@ 16,7 
s3=1 e@ 16,7 
$idsi1 @ 16, 
Siz=i 6 15,8 
$1051 8 16.8 


s8=i1 2@ 16,8 
sh=1 @ 16,8 
s4si a 16.8 
s?=1 # 17 

Bl=i"ye" 19,1 


state is now: 

Currert time= 1960 
Ccloeks=0b0!1 cin=0 couten 
aaas=Oendcon1connognognogy 
HhPoSOr is 107 71 Pl iis 
eumsORoti tity tity i114 


Stem peains & 1960 Ns, 
Cinei a0 
pni2=0 @ 0 


St@e reains & 193970 ns, 
oniisi 2 0 


Stem hecins @ 1998 ns, 
goniiso ©« QO 


Sten recins @€ 2005 ns, 
Oni z= ies i 

$16=0 @ 14,2 

si13=0 ® 16,4 

slos(ser 16.4 

si14s¢C @ 16,5 

s$12=0 @ 16,5 

Gouts!]) ©7214 

state is now: 

Current time= 2030 
eleocks=0eE0l cinr=) couvre! 
aaae=O0roonge1d000000NnNNNNDN 
BEPESOOTIITOT Pi iii i 
sums0b10000011111111111 


Step beains e 2030 ns, 
bi6=s0 @ 
riSs0 
ni4é=0 
bi3sod 
b1is=0 
BP O=9 
p92 e@ Nn 
pR=0 e@ 0 


DDD D D 
oo 90 Oo CoO 


a3 
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F720 2 0 
Hn6s0 @ 0 
hS=0 24 0 
b4ef we 0 
n3=s90 B@ 0 
6b2s0 e GQ 
b1=0 @ Q 
ai?7s¢ @ fn 
ohi2=0 4 0 


Stec hecins e& 204C ns, 
pniisi # O 


Sten henins © 2965 ns, 
Beas e@ O 


Step beairs * 2075 ns. 
pni2=1i 6 0 

§l6=i € 14,4 

si3si ¢ 16.7 

eel ©€ 16.7 

si4=1 @ 16,8 

si2=1 € 16.8 

couts0 #& 22.9 

state ise now; 

Current times 2109 
erecks=O0t01l cin=i eovutso0 
aaaa=OrntoocnnGgtnod000nn 
roprsdorcond00nnncodo0nds 
eur =Ob0L911111111111111 
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eis=o0 a 0 
eni2z=0 @ O 
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Ster pecins @ 2135 ns, 
phiiso e 0 
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B2i7=i e 0 
$16=0 @ 14,2 
s$9=0 @ 16,4 
$11=0 ® Deiei4 
si3=0 @ 16,4 
siS=0 2 16,4 
s7=0 a 16,4 
s5=N e@ 16,4 
s3=0 8&8 16,4 
si4=0 e@ 16,5 
$1720 e@ 16,5 
si0sn @ 16,5 
sa&s0 =) 16,5 
s6=0 © 16,5 
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s4=0 fa ion oS 

¢2=0 © 16,7 

st=0¢ 20 
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state is now: 

Current time=s 2170 
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sumsdpnicrnodo0o0otcnnnginnd 
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oPpitesi $< 0 
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cniiso e€ 0 
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enir?=1 # 0 

coutso @& 22,93 

state ic now 

Current times 2240 
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Steer reains &® 2240 ns, 
mni2=0 # O 
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cloekseOh0)i cin=0 couts0 
aaaa=0Lon0nngqgnadndoncoodn 
bbbb=0h0000000000000000 
sumedpndQ0coddd00qg00ngconroonud 


Stee bpeeins @ 2310 ns, 
phi2=0 8 0 


Stec bpecins eR 2320 ns, 
ohniisi @¢ 0 


Ster vecins #® 2345 ns, 


Foo 
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pecs 6 15222 1964 erie l.loc Mace 17 


onit=o F&F 0 


Stee becins @ 2355 ns. 
oni2=s1 @ 

state is now; 

Current times 23890 
ereews=O0Ord! cin=sOmeout=nh 
aaea=Orocnoncorocoqdoqnae 
DoErSsOLECdNNNCOoCOnVOONCO 
sum=eNBQd0ONNKnaAndnargognno 
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